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(57) An information processing apparatus includes 
a generating unit for generating ClipMark formed by a 
mark specifying a characteristic picture extracted from 
an input AV stream, as management information for su- 
pervising the AV stream, and for generating PlayList- 
Mark, formed by marks pointing to a picture optionally 



specified by a user, from a reproduction domain corre- 
spondingto the PlayList defining the combination of pre- 
set domains in the AV stream, and a recording unit for 
recording the ClipMark and PlayListMark as respective 
independent tables on a recording medium. With the in- 
formation processing apparatus, it is possible to access 
a desired location of an AV stream promptly and reliably. 
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Description 

Technical Field 

5 [0001] This invention relates to an information processing method and apparatus, program and recording medium. 
More particularly, it relates to an information processing method and apparatus, program and recording medium in 
which a desired location of an AV stream can be accessed extremely promptly. 

Background Art 

10 

[0002] Recently, a variety of types of optical discs have been proposed as a recording medium that can be removed 
from a recording apparatus. These recordable optical discs have been proposed as a large capacity medium of several 
GBs and are thought to be promising as a medium for recording AV (audio visual) signals, 

[0003] Among the sources (supply sources) of digital AV signals, recorded on the recordable optical disc, there are, 
15 for example, a bitstream the recording apparatus itself forms on picture compression of analog input audio video signals 
in accordance with the MPEG-2 system, and a bitstream of the MPEG-2 system, directly obtained from electrical waves 
of digital television broadcast. In general, an MPEG-2 transport stream is used in digital television broadcast. A transport 
stream is composed of plural concatenated transport packets. Each transport packet is a packetized MPEG-2 video 
stream or MPEG-1 audio stream, with the data length of each transport packet being 188 bytes in length. If an AV 
20 program of a transport stream, received over digital television broadcast, is directly recorded on an optical disc by a 
recorder, recording can be made without deteriorating the video or audio quality whatsoever. 

[0004] In order to permit a user to search a scene of interest, such as a program locating point, in the transport 
stream recorded on the optical disc, it is required to be able to perform random-access reproduction. 
[0005] In general, in an MPEG-2 video stream, l-pictures are encoded at an interval of 0.5 sec, whilst other pictures 
25 are encoded as P- or B-pictures. Thus, in reproducing the video by random-accessing a transport stream recorded on 
the optical disc, l-pictures must be searched first. 

[0006] However, if a transport stream, recorded on the optical disc, is random-accessed for video reproduction, it is 
difficult to search the starting byte of an i-picture efficiently. That is, the syntax of the video stream needs to be analyzed 
from a random byte position of a transport stream on the optical disc to search the start byte of the l-picture. So, the 
30 |-picture searching is extremely time-consuming, such that it is difficult to perform random-access reproduction with 
quick response to the user input. 

Disclosure of the Invention 

35 [0007] It is therefore another object of the present invention to enable decision of the position of transport stream 
readout from the recording medium and stream decoding start quickly responsive to a random-access reproduction 
command of a user. 

[0008] In one aspect, the present invention provides an information processing apparatus including generating 
means for generating ClipMark formed by a mark specifying a characteristic picture extracted from an input AV stream, 
40 as management information for supervising the AV stream, and for generating PlayListMark, formed by marks pointing 
to a picture optionally specified by a user, from a reproduction domain corresponding to the PlayList defining the com- 
bination of preset domains in the AV stream, and recording means for recording the ClipMark and PlayListMark as 
respective independent tables on a recording medium. 

[0009] The generating means generates the ClipMark and the PlayListMark as a Clipl nformation file and as a PlayList 
45 file, respectively. 

[0010] The PlayListMark may further include a mark specifying a Resume point in reproducing the PlayList. 
[0011] In reproducing the PlayList, reference may be made to the mark forming the ClipMark of the AV stream cor- 
responding to the reproduction domain of the PlayList. . 

[0012] The mark of the PlayListMark may include presentation time stamp and the identification information for in- 
50 dicating one reproduction domain specified on the AV stream data forming the reproduction route of the PlayList. 

[0013] The mark forming the ClipMark or the mark forming the PlayListMark may include the information specifying 
an entry point of an elementary stream. 

[0014] The mark of the PlayListMark may include the information of the type at least including the Resume point of 
the start point of a favorite scene or a Resume point of the PlayList specified by a user. 
55 [0015] The mark of the ClipMark and the mark of the PlayListMark may be represented by an address of a relative 
source packet corresponding to an entry point of the AV stream. 

[0016] The mark of the ClipMark and the mark of the PlayListMark may be represented by a first address of a relative 
source packet corresponding to an entry point of the AV stream and by a second address of an offset from the first 
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address, respectively. 

[0017] The information processing apparatus may further include type detection means for detecting the type of the 
characteristic picture detected at the time of recording by the first recording means, with the first recording means then 
recording the mark forming the ClipMark and the type as detected by the detection means. 

[0018] The mark of the ClipMark may include a scene change point, a commercial start point, commercial end point 
or a title displaying scene. 

[0019] In another aspect, the present invention provides an information processing method including a step of gen- 
erating ClipMark formed by a mark specifying a characteristic picture extracted from an input AV stream, as manage- 
ment information for supervising the AV stream, and of generating PlayListMark, formed by marks pointing to a picture 
optionally specified by a user, from a reproduction domain corresponding to the PlayList defining the combination of 
preset domains in the AV stream, and a step of recording the ClipMark and PlayListMark as respective independent 
tables on a recording medium. 

[0020] In still another aspect, the present invention provides a recording medium having recorded thereon a com- 
puter-readable program, the computer-readable program including a step of generating ClipMark formed by a mark 
specifying a characteristic picture extracted from an input AV stream, as management information for supervising the 
AV stream, and of generating PlayListMark, formed by marks pointing to a picture optionally specified by a user, from 
a reproduction domain corresponding to the PlayList defining the combination of preset domains in the AV stream, and 
a step of recording the ClipMark and PlayListMark as respective independent tables on a recording medium. 
[0021] In still another aspect, the present invention provides a program for having a computer execute a step of 
generating ClipMark formed by a mark specifying a characteristic picture extracted from an input AV stream, as man- 
agement information for supervising the AV stream, and of generating PlayListMark, formed by marks pointing to a 
picture optionally specified by a user, from a reproduction domain corresponding to the PlayList defining the combination 
of preset domains in the AV stream, and a step of recording the ClipMark and PlayListMark as respective independent 
tables on a recording medium. 

[0022] In still another aspect, the present invention provides a information processing apparatus including readout 
means for reading the management information for supervising the AV stream including ClipMark formed by marks 
pointing to a characteristic picture extracted from an AV stream, and also for reading, from a reproduction domain 
corresponding to the PlayList defining the combination of preset domains in the AV stream, PlayListMark formed by a 
mark pointing to a picture optionally specified by a user, presentation means for presenting the management information 
read out by the readout means and the information by the PlayListMark, referencing means for referencing, from the 
information presented by the presentation means, the ClipMark corresponding to the PlayList commanded to be re- 
produced by the user and reproducing means for reproducing the AV stream including the ClipMark, referenced by the 
referencing means, from a position corresponding to the ClipMark. 

[0023] The presentation means may present a list by the thumbnail picture associated with the PlayListMark. 
[0024] The mark corresponding to the ClipMark and the mark corresponding to the PlayListMark are represented by 
an address of a relative source packet corresponding to an entry point of the AV stream. 

[0025] The mark of the ClipMark and the mark of the PlayListMark may be represented by a first address of a relative 
source packet corresponding to an entry point of the AV stream and by a second address of an offset from the first 
address, respectively. 

[0026] The mark of the ClipMark may include a scene change point, a commercial start point, a commercial end 
point or a title displaying scene. 

[0027] In still another aspect, the present invention provides an information processing method including a readout 
step of reading the management information for supervising the AV stream including ClipMark formed by marks pointing 
to a characteristic picture extracted from an AV stream, and also for reading, from a reproduction domain corresponding 
to the PlayList defining the combination of preset domains in the AV stream, PlayListMark formed by a mark pointing 
to a picture optionally specified by a user, a presentation step of presenting the management information read out by 
the readout step and the information by the PlayListMark, a referencing step of referencing, from the information pre- 
sented by the presentation step, the ClipMark corresponding to the PlayList commanded to be reproduced by the user, 
and a reproducing step of reproducing the AV stream including the ClipMark, referenced by the referencing step, from 
a position corresponding to the ClipMark. 

[0028] In another aspect, the present invention provides a recording medium having recorded thereon a computer- 
readable program, the computer-readable program including a readout step of reading the management information 
for supervising the AV stream including ClipMark formed by marks pointing to a characteristic picture extracted from 
an AV stream, and also for reading, from a reproduction domain corresponding to the PlayList defining the combination 
of preset domains in the AV stream, PlayListMark formed by a mark pointing to a picture optionally specified by a user, 
a presentation step of presenting the management information read out by the readout step and the information by the 
PlayListMark, a referencing step of referencing, from the information presented by the presentation step, the ClipMark 
corresponding to the PlayList commanded to be reproduced by the user and a step of recording the ClipMark and 
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PlayListMark as respective independent tables on a recording medium. 

[0029] In still another aspect, the present invention provides a program for having a computer execute a readout 
step of reading the management information for supervising the AV stream including ClipMark formed by marks pointing 
to a characteristic picture extracted from an AV stream, and also for reading, from a reproduction domain corresponding 

5 to the PlayList defining the combination of preset domains in the AV stream, PlayListMark formed by a mark pointing 
to a picture optionally specified by a user, a presentation step of presenting the management information read out by 
the readout step and the information by the PlayListMark, a referencing step of referencing, from the information pre- 
sented by the presentation step, the ClipMark corresponding to the PlayList commanded to be reproduced by the user, 
and a step of recording the ClipMark and PlayListMark as respective independent tables on a recording medium. 

io [0030] In yet another aspect, the present invention provides a program having recorded thereon, as independent 
tables, the management information for supervising the AV stream including ClipMark formed by marks pointing to a 
characteristic picture extracted from an AV stream, and also for reading, from a reproduction domain corresponding 
to the PlayList defining the combination of preset domains in the AV stream, PlayListMark formed by a mark pointing 
to a picture optionally specified by a user. 

*5 [0031] With the information processing method and apparatus and the program, according to the present invention, 
the ClipMark formed by a mark indicating a characteristic picture extracted from an input AV stream is generated as 
management information for supervising the AV stream, whilst the PlayListMark, formed by marks pointing to the picture 
optionally specified by the user, is generated from the playback domain corresponding to the PlayList defining the 
combination of preset domains in the AV stream. The ClipMark and the PlayListMark are recorded as respective inde- 

20 pendent tables on the recording medium. 

[0032] With the information processing method and apparatus and the program, according to the present invention, 
the management information forsupervising the AV stream include ClipMark formed by mark pointing to a characteristic 
picture extracted from an AV stream, PlayListMark formed by a mark pointing to a picture optionally specified by a user 
are read out from a reproduction domain corresponding to the PlayList defining the combination of preset domains in 

25 the AV stream, the management information read out and the information by the PlayListMark are presented, the 
ClipMark corresponding to the PlayList commanded to be reproduced by the user is referenced from the so presented 
information, and the AV stream including the ClipMark is reproduced from a position corresponding to the ClipMark. 
[0033] Other objects, features and advantages of the present invention will become more apparent from reading the 
embodiments of the present invention as shown in the drawings. 

30 

Brief Description of the Drawings 
[0034] 

35 Fig. 1 shows a configuration of an embodiment of a recording and/or reproducing apparatus according to the present 

invention. 

Fig. 2 illustrates the data format of data recorded on a recording medium by a recording and/or reproducing appa- 
ratus 1 . 

Fig.3 illustrates Real PlayList and Virtual PlayList. 
40 Figs.4A, 4B and 4C illustrate the creation of the Real PlayList. 

Figs.5A, 5B and 5C illustrate deletion of the Real PlayList. 

Figs.6A and 6B illustrate assemble editing. 

Fig. 7 illustrates provision of a sub path in the Virtual PlayList. 

Fig. 8 illustrates the changing of the playback sequence of the PlayList. 
45 Fig. 9 illustrates a mark on the PlayList and a mark on the Clip. 

Fig.1 0 illustrates a menu thumbnail. 

Fig.11 illustrates mark added to the PlayList. 

Fig.1 2 illustrates a mark added to the Clip. 

Fig. 13 illustrates the relation between the PlayList, Clip and the thumbnail file. 
50 Fig.1 4 illustrates a directory structure. 

Fig. 15 illustrates a syntax of infr.dvr. 

Fig.1 6 shows a syntax of DVRVolume. 

Fig.1 7 shows a syntax of ResumeVolume. 

Fig.1 8 shows a syntax of U I Applnfo Volume. 
55 Fig.1 9 shows a table of character set values. 

Fig.20 shows a syntax of TableOf PlayList. 

Fig.21 shows another syntax of TableOf PlayList. 

Fig. 22 shows a syntax of the Makers PrivateData. 
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Fig. 23 shows a syntax of xxxx.rpls and yyyy.vpls. 

Figs.24Ato 24C Illustrate the PlayList. 

Fig. 25 shows a syntax of PlayList. 

Fig. 26 shows a table of PlayList_type. 

Fig. 27 shows a syntax of UlApp Info Play List. 

Figs.28Ato 28C illustrate flags in the UlApplnfoPlayList syntax shown in Fig.27. 

Fig. 29 illustrates a Play Item. 

Fig. 30 illustrates a Playltem. 

Fig. 31 illustrates a Playltem. 

Fig.32 shows a syntax of the Playltem. 

Fig.33 illustrates IN-time. 

Fig. 3 4 illustrates OUT-time. 

Fig.35 shows a table of Connection_Condition. 

Figs.36A to 36D illustrate Connection_Condition. 

Fig.37 illustrates BridgeSequencelnfo. 

Fig.38 shows a syntax of BridgeSequencelnfo. 

Fig.39 illustrates SubPlayltem. 

Fig.40 shows a syntax of SubPlayltem. 

Fig.41 shows a table of Markjype. 

Fig.42 shows a syntax of PlayListMark. 

Fig. 43 shows a table of Mark_type. 

Fig. 44 illustrates Mark_time_stamp. 

Fig.45 shows a syntax of zzzzz.clip. 

Fig.46 shows a syntax of Cliplnfo. 

Fig.47 shows a table of Clip_stream_type. 

Fig.48 illustrates offset_SPN. 

Fig.49 illustrates offset_SPN. 

Figs.50A, 50B illustrate the STC domain. 

Fig.51 illustrates STC_lnfo. 

Fig. 52 shows a syntax of STC_lnfo. 

Fig. 53 illustrates Prog ram Info. 

Fig. 54 shows a syntax of Programlnfo. 

Fig. 55 shows a syntax of VideoCondinglnfo. 

Fig. 56 shows a table of Video_format. 

Fig. 5 7 shows a table of frame_rate. 

Fig. 58 shows a table of display_aspect_ratio. 

Fig. 59 shows a syntax of AudioCondinglnfo. 

Fig. 60 shows a table of audio_coding. 

Fig. 61 shows a table of audio_component_type. 

Fig. 62 shows a table of sampling_frequency. 

Fig. 63 illustrates CPI. 

Fig.64 illustrates CPI. 

Fig. 65 shows a syntax of CP!. 

Fig.66 shows a table of CPLtype. 

Fig. 6 7 illustrates video EP_map. 

Fig. 68 illustrates EP_map. 

Fig.69 illustrates EP_map. 

Fig.70 shows a syntax of EP_map. 

Fig. 71 shows a table of EP_typevalues. 

Fig.72 shows a syntax of EP_map_for_one_stream_PID. 

Fig. 73 illustrates TU_map. 

Fig. 74 shows a syntax of TUjnap. 

Fig.75 shows a syntax of ClipMark. 

Fig. 76 shows a table of Mark_type. 

Fig.77 shows a table of Mark_type_stamp. 

Fig. 78 shows another syntax of ClipMark. 

Fig.79 shows another example of a table of Mark_type. 

Fig.80 shows examples of mark_entry() and representative_picture_entry(). 
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Fig. 81 shows a syntax of mark_entry() and representative_picture_entry(). 

Fig. 82 shows another example of the syntax of mark_entry() and representative_picture_entry(). 

Fig.83 shows the relation between RSPN_ref_EP_start and offset_num_pictures. 

Fig. 84 shows another example of the syntax of mark_entry() and representative_picture_entry(). 

Fig.85 shows the relation between ClipMark and EP_map. 

Fig. 86 shows a syntax of menu.thmb and mark.thmb. 

Fig. 87 shows a syntax of Thumbnail. 

Fig. 88 shows a table of thumbnail_pictu reformat. 

Figs.89A and 89B illustrate tn_block. 

Fig. 90 illustrates the structure of a transport stream of DVMPEG2. 

Fig. 91 shows a recorder model of the transport stream of DVMPEG2. 

Fig.92 shows a player model of the transport stream of DVMPEG2. 

Fig. 93 shows a syntax of source packet. 

Fig. 94 shows a syntax of TP_extra_header. 

Fig. 95 shows a table of copy permission indicator. 

Fig. 96 illustrates seamless connection. 

Fig. 97 illustrates seamless connection. 

Fig. 98 illustrates seamless connection. 

Fig. 99 illustrates seamless connection. 

Fig. 100 illustrates seamless connection. 

Fig. 101 illustrates audio overlap. 

Fig. 102 illustrates seamless connection employing BridgeSequence. 
Fig. 103 illustrates seamless connection not employing BridgeSequence. 
Fig.1 04 shows a DVRSTD model. 
Fig.1 05 shows a timing chart for decoding and display. 

Fig.1 06 is a flowchart for illustrating indexing reproduction of a scene indicated by a mark point in the case of the 
syntax of Fig. 81 . 

Fig.1 07 illustrates the operation of reproduction in case the mark_entry()/representative_picture_entry() of Clip- 
Mark in Fig. 75 or Fig. 78 is the syntax of Fig. 81 . 
Fig.1 08 shows an example of EP_mark. 
Fig.1 09 shows an example of ClipMark. 

Fig.1 10 is a flowchart for illustrating the CM skipping reproducing processing in case the mark_entry()/ 
representative_picture_entry() of ClipMark in Fig.75 or Fig. 78 is the syntax of Fig.81 . 

Fig.1 1 1 , continuing to Fig.1 10, is a flowchart for illustrating the CM skipping reproducing processing in case the 

mark_entry()/representative_picture_entry() of ClipMark in Fig.75 or Fig.78 is the syntax of Fig.81 . 

Fig.1 12 is a flowchart for illustrating the indexing reproduction of a scene indicated by a mark point in case the 

mark_entry()/representative_picture_entry() of ClipMark in Fig.75 or Fig.78 is the syntax of Fig. 82. 

Fig.1 13, continuing to Fig.1 12, is a flowchart for illustrating reproduction in case the mark_entry()/ 

representative_picture_entry() of ClipMark in Fig.75 or Fig.78 is the syntax of Fig. 82. 

Fig.1 14 shows an example of EP_map. 

Fig.1 15 shows an example of ClipMark. 

Fig.1 16 is a flowchart for illustrating the CM skipping reproducing processing in case the mark_entry()/ 
representative_picture_entry() of ClipMark in Fig.75 or Fig.78 is the syntax of Fig. 82. 

Fig.1 17, continuing to Fig. 11 6, is a flowchart for illustrating the CM skipping reproducing processing in case the 

mark_entry()/representative_picture_entry() of ClipMark in Fig.75 or Fig.78 is the syntax of Fig. 82. 

Fig.1 18 is a flowchart for illustrating the indexing reproduction of a scene indicated by a mark point in case the 

mark_entry()/representative_picture_entry() of ClipMark in Fig.75 or Fig.78 is the syntax of Fig. 84. 

Fig.1 19, continuing to Fig.1 18, is a flowchart for illustrating reproduction in case the mark_entry()/ 

representative_picture_entry() of ClipMark in Fig.75 or Fig.78 is the syntax of Fig. 84. 

Fig. 120 shows an example of EP_map. 

Fig. 121 shows an example of ClipMark. 

Fig.1 22 is a flowchart for illustrating the CM skipping reproducing processing in case the mark_entry()/ 
representative_picture_entry() of ClipMark in Fig.75 or Fig.78 is the syntax of Fig. 84. 

Fig.1 23, continuing to Fig.1 22, is a flowchart for illustrating the CM skipping reproducing processing in case the 

mark_entry()/representative_picture_entry() of ClipMark in Fig.75 or Fig.78 is the syntax of Fig. 84. 

Fig.1 24 shows an application format. 

Fig.1 25 illustrates a mark on PlayList and a mark on Clip. 

Fig.1 26 shows another example of the ClipMark syntax. 
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Fig. 127 shows still another example of the ClipMark syntax. 

Fig. 128 is a flowchart for illustrating the preparation of ClipMark in case analog AV signals are encoded and re- 
corded. 

Fig. 129 is a flowchart for illustrating the preparation of ClipMark in case a transport stream is to be recorded. 
5 Fig. 130 is a flowchart for illustrating the preparation of RealPlayList. 

Fig. 131 is a flowchart for illustrating the preparation of VirtualPlayList. 

Fig. 132 is a flowchart for illustrating the reproduction of PlayList. 

Fig. 133 is a flowchart for illustrating the preparation of PlayListMark. 

Fig. 134 is a flowchart for illustrating the locating reproduction for reproducing PlayList. 
io Fig. 1 35 shows the syntax of PlayListMark. 

Fig. 136 illustrates Mark_type of PlayListMark, 

Fig. 137 shows another syntax of ClipMark. 

Fig. 138 illustrates mark_type of ClipMark. 

Fig. 139 illustrates a medium. 



15 



Best Mode for Carrying out the Invention 



[0035] Referring to the drawings, present embodiment of the present invention will be explained in detail. Fig.1 shows 
a typical inner structure of a recording and/or reproducing apparatus 1 embodying the present invention. First, the 
20 structure of a recording unit 2, configured for recording signals input from outside, is explained. The recording and/or 
reproducing apparatus 1 is configured for being fed with and recording analog or digital data. 

[0036] Analog video signals and analog audio signals are fed to terminals 11 , 12, respectively. The video signals, 
input to the terminal 11 , are output to an analysis unit 14 and to an AV encoder 15. The audio signals, input to the 
terminal 12, are output to the analysis unit 14 and to the AV encoder 15. The analysis unit 14 extracts feature points, 

25 such as scene changes, from the input video and audio signals. 

[0037] The AV encoder 15 encodes input video and audio signal to output the system information (S), such as an 
encoded video stream (V), an encoded audio stream (A) and AV synchronization, to a multiplexer 16. 
[0038] The encoded video stream is a video stream encoded e.g., with the MPEG (Moving Picture Expert Group) 2 
system, whilst the encoded audio stream is an audio stream encoded in accordance with the MPEG1 system, with the 

30 encoded audio stream being e.g., an audio stream encoded in e.g., the MPEG1 system or an audio stream encoded 
in accordance with the Dolby AC3 (trademak) system. The multiplexer 1 6 multiplexes the input video and audio streams, 
based on the input system information, to output a multiplexed stream through a switch 17 to a multiplexed stream 
analysis unit 18 and to a source packetizer 19. 

[0039] The multiplexed stream is e.g., an MPEG-2 transport stream or an MPEG2 program stream. The source 
35 packetizer 1 9 encodes the input multiplexed stream into an AV stream composed of source packets in accordance 
with an application format of a recording medium 100 on which to record the stream. The AV stream is processed in 
ECC (error correction and coding) unit 20 and a modulation unit 21 with appendage of ECC codes and with modulation, 
before being output to a write unit 22, which then writes (records) an AV stream file based on a control signals output 
by the controller 23. 

[0040] The transport stream, such as digital television broadcast, input from a digital interface or a digital television 
tuner, is input to a terminal 1 3. There are two recording systems for recording the transport stream input to the terminal 
13, one being a transparent recording system and the other being a system in which recording is preceded by re- 
encoding aimed to lower e.g., the recording bit rate. The recording system command information is input from a terminal 
24 as a user interface to a controller 23. 
45 [0041] In the transparent recording of the input transport stream, a transport stream, input to a terminal 13, is output 
through a switch 1 7 to a multiplexed stream analysis unit 1 8 and to the source packetizer 19. The ensuing processing 
of recording an AV stream on a recording medium is the same as that of encoding and recording analog input audio 
and video signals, as described above, and hence is not explained here for simplicity. 

[0042] If the input transport stream is re-encoded and subsequently recorded, the transport stream, input to the 
50 terminal 1 3, is fed to a demultiplexer 26, which demultiplexes the input transport stream to extract a video stream (V), 
an audio stream (A) and the system information (S). 

[0043] Of the stream (information), as extracted by the demultiplexer 26, the video stream is output to an audio 
decoder 27, whilst the audio stream and the system information are output to the multiplexer 16. The audio decoder 
27 decodes the input transport stream to output the encoded video stream (V) to the multiplexer 16. 
55 [0044] The audio stream and the system information, output from the demultiplexer 26 and input to the multiplexer 
16, and the video stream, output by the AV encoder 15, are multiplexed, based on the input system information, and 
output to the multiplexed stream analysis unit 1 8 and to the source packetizer 1 9 through switch 1 7, as a multiplexed 
stream. The ensuing processing of recording an AV stream on a recording medium is the same as that of encoding 



7 



EP 1 280 348 A1 



and recording analog input audio and video signals, as described above, and hence is not explained here for simplicity. 
[0045] The recording and/or reproducing apparatus 1 of the present embodiment records a file of the AV stream on 
the recording medium 100, while also recording the application database information which accounts for the file. The 
input information to the controller 23 is the feature information for the moving picture from the analysis unit 14, the 
5 feature information of the AV stream from the multiplexed stream analysis unit 1 8 and the user command information 
input at a terminal 24. 

[0046] The feature information of the moving picture, supplied from the analysis unit 1 4, is generated by the analysis 
unit 14 when the AV encoder 15 encodes video signals. The analysis unit 14 analyzes the contents of the input video 
and audio signals to generate the information pertinent to the pictures characteristic of the input moving picture signals 
10 (clip mark). This information is the information indicating a picture of characteristic clip mark points, such as program 
start points, scene change points, CM commercial start and end points, title or telop in input video signals, and also 
includes a thumbnail of the picture and the information pertinentto stereo/monaural switching points and muted portions 
of audio signals. 

[0047] The above picture indicating information is fed through controller 23 to the multiplexer 1 6. When multiplexing 
15 a encoded picture specified as clip mark by the controller 23, the multiplexer 1 6 returns the information for specifying 
the encoded picture on the AV stream to the controller 23. Specifically, this information is the PTS (presentation time 
stamp) of a picture or the address information on the AV stream of an encoded version of the picture. The controller 
23 stores the sort of feature pictures and the information for specifying the encoded picture on the AV stream in asso- 
ciation with each other. 

20 [0048] The feature information of the AV stream from the multiplexed stream analysis unit 18 is the information 
pertinent to the encoding information of the AV stream to be recorded, and is recorded by an analysis unit 18. For 
example, the feature information includes the time stamp and address information of the l-picture in the AV stream, 
discontinuous point information of system time clocks, encoding parameters of the AV stream and change point infor- 
mation of the encoding parameters in the AV stream. When transparently recording the transport stream, input from 

25 the terminal 13, the multiplexed stream analysis unit 18 detects the picture of the aforementioned clip mark, from the 
input transport stream, and generates the information for specifying a picture designated by the clip mark and its type. 
[0049] The user designation information from the terminal 24 is the information specifying the playback domain, 
designated by the user, character letters for explaining the contents of the playback domain, or the information such 
as bookmarks or resuming points set by the user for his or her favorite scene. 

30 [0050] Based on the aforementioned input information, the controller 23 creates a database of the AV stream (Clip), 
a database of a group (PlayList) of playback domains (Playltem) of the AV stream, management information of the 
recorded contents of the recording medium 100 (info.dvr) and the information on thumbnail pictures. Similarly to the 
AV stream, the application database information, constructed from the above information, is processed in the ECC unit 
20 and the modulation unit 21 and input to the write unit 22, which then records a database file on the recording medium 

35 100. 

[0051] The above-described application database information will be explained subsequently in detail. 
[0052] When the AV stream file recorded on the recording medium 100 (files of picture data and speech data) and 
the application database information, thus recorded on the recording medium 100, are reproduced by a reproducing 
unit 3, the controller 23 first commands a readout unit 28 to read out the application database information from the 
40 recording medium 1 00. The readout unit 28 reads out the application database information from the recording medium 
1 00, which then reads out the application database information from the recording medium 1 00 to send the application 
database information through demodulation and error correction processing by a demodulating unit 29 and an ECC 
decoder 30 to the controller 23. 

[0053] Based on the application database information, the controller 23 outputs a list of PlayList recorded on the 
45 recording medium 100 to a user interface of the terminal 24. The user selects the PlayList, desired to be reproduced, 

from the list of PlayLists. The information pertinent to PlayList, specified to be reproduced, is input to the controller 23. 

The controller 23 commands the readout unit 28 to read out the AV stream file necessary in reproducing the PlayList. 

In accordance with the command, the readout unit 28 reads out the corresponding AV stream from the recording 

medium 1 00 to output the read-out AV stream to the demodulating unit 29. The AV stream, thus input to the demodu- 
50 lating unit 29, is demodulated by preset processing and output through the processing by the ECC decoder 30 to a 

source depacketizer 31 . 

[0054] The source depacketizer 31 converts the AV stream of the application format, read out from the recording 
medium 1 00 and processed in a preset fashion, into a stream processable by the demultiplexer 26. The demultiplexer 
26 outputs the system information (S), such as the video stream (V), audio stream (A) or the AV synchronization, 
55 forming the playback domain (Playltem) of the AV stream specified by the controller 23, to the audio decoder 27, which 
AV decoder 27 decodes the video stream and the audio stream to output the playback video signal and the playback 
audio signal to associated terminals 32, 33, respectively. 

[0055] If fed from the terminal 24, as a user interface, with the information instructing random access playback or 
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special playback, the controller 23 determines the readout position of the AV stream from the recording medium 1 00, 
based on the contents of the database (Clip) of the AV stream, to command the readout unit 28 to read out the AV 
stream. If the PlayList as selected by the user is to be reproduced as from a preset time point, the controller 23 com- 
mands the readout unit 28 to read out data from an l-picture having a time stamp closest to the specified time point. 

5 [0056] When the user has selected a certain clip mark from indexing points or scene change points for the program 
stored in the ClipMark in the Clip Information, as_when the user selects a certain picture from a list of thumbnail pictures, 
as demonstrated on a user interface, of the indexing points or scene change points stored in the ClipMark, the controller 
23 determines the AV stream readout position from the recording medium 1 00 to command the readout unit 28 to read 
out the AV stream. That is, the controller 23 commands the readout unit 28 to read out data from an l-picture having 

10 an address closest to the address on the AV stream which has stored the picture selected by the user. The readout 
unit 28 reads out data from the specified address . The read-out data is processed by the demodulating unit 29, ECC 
decoder 30 and by the source packetizer 19 so as to be supplied to the demultiplexer 26 and decoded by the audio 
decoder 27 to reproduce AV data indicated by an address of the mark point picture. 

[0057] If the user has commanded fast forward playback, the controller 23 commands the readout unit 28 to sequen- 
is tially read out l-picture data in the AV stream in succession based on the database (Clip) of the AV stream. 

[0058] The readout unit 28 reads out data of the AV stream from a specified random access point. The so read-out 
data is reproduced through processing by various components on the downstream side. 

[0059] The case in which the user edits the AV stream recorded on the recording medium 1 00 is now explained. If 
desired to specify a playback domain for the AV stream recorded on the recording medium 100, for example, if desired 
20 to create a playback route of reproducing a portion sung by a singer A from a song program A, and subsequently 
reproducing a portion sung by the same singer A from another song program B, the information pertinent to a beginning 
point (IN-point) and an end point (OUT-point) of the playback domain is input to the controller 23 from the terminal as 
a user interface. The controller 23 creates a database of the group (PlayList) of playback domains (Play Item) of the 
AV streams. 

25 [0060] When the user desires to erase a portion of the AV stream recorded on the recording medium 1 00, the infor- 
mation pertinent to the IN-point and the OUT-point of the erasure domain is input to the controller 23, which then 
modifies the database of the PlayList so as to refer to only the needed AV streams. The controller 23 also commands 
the write unit 22 to erase an unneeded stream portion of the AV stream. 

[0061] The case in which the user desires to specify playback domains of an AV stream recorded on the recording 
30 medium to create a new playback route, and to interconnect the respective playback domains in a seamless fashion, 
is now explained. In such case, the controller 23 creates a database of a group (PlayList) of the playback domains 
(Playltem) of the AV stream and undertakes to partially re-encode and re-multiplex the video stream in the vicinity of 
junction points of the playback domains. 

[0062] The picture information at the IN-point and that at the OUT-point of a playback domain are input from a terminal 
35 24 to a controller 23. The controller 23 commands the readout unit 28 to read out data needed to reproduce the pictures 
at the IN-point and at the OUT-point. The readout unit 28 reads out data from the recording medium 1 00. The data so 
read out is output through the demodulating unit 29, ECC decoder 30 and the source packetizer 1 9 to the demultiplexer 
26. 

[0063] The controller 23 analyzes data input to the demultiplexer 26 to determine the re-encoding method for the 

40 video stream (change of picture_coding type and assignment of the quantity of encoding bits for re-encoding) and 

the re-multiplexing system to send the system to the AV encoder 15 and to the multiplexer 1 6. 

[0064] The demultiplexer 26 then separates the input stream into the video stream (V), audio stream (A) and the 
system information (S). The video stream may be classed into data input to the audio decoder 27 and data input to 
the multiplexer 16. The former is data needed for re-encoding, and is decoded by the audio decoder 27, with the 
45 decoded picture being then re-encoded by the AV encoder 15 and thereby caused to become a video stream. The 
latter data is data copied from an original stream without re-encoding. The audio stream and the system information 
are directly input to the multiplexer 1 6. 

[0065] The multiplexer 1 6 multiplexes an input stream, based on the information input from the controller 23, to output 
a multiplexed stream, which is processed by the ECC unit 20 and the modulation unit 21 so as to be sent to the write 
50 unit 22. The write unit 22 records an AV stream on the recording medium 100 based on the control signals supplied 
from the controller 23. 

[0066] The application database information and the operation based on this information, such as playback and 
editing, are hereinafter explained. Fig. 2 shows the structure of an application format having two layers, that is PlayList 
and Clip, for AV stream management. The Volume Information manages ail Clips and PlayLists in the disc. Here, one 
55 AV stream and the ancillary information thereof, paired together, is deemed to be an object, and is termed Clip. The 
AV stream file is termed a Clip AV stream file, with the ancillary information being termed the Clip Information file. 
[0067] One Clip AV stream file stores data corresponding to an MPEG-2 transport stream arranged in a structure 
prescribed by the application format. By and large, a file is treated as a byte string. The contents of the Clip AV stream 
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file are expanded on the time axis, with entry points in the Clip (l-picture) being mainly specified on the time basis. 
When a time stamp of an access point to a preset Clip is given, the Clip Information file is useful in finding the address 
information at which to start data readout in the Clip AV stream file. 

[0068] Referring to Fig .3, PlayList is now explained, which is provided for a user to select a playback domain desired 
5 to be viewed from the Clip and to edit the playback domain readily. One PlayList is a set of playback domains in the 
Clip. One playback domain in a preset Clip is termed Playltem and is represented by a pair of an IN-point and an OUT- 
point on the time axis. So, the PlayList is formed by a set of plural Playltems. 

[0069] The PlayList is classified into two types, one of which is Real PlayList and the other of which is Virtual PlayList. 
The Real PlayList co-owns stream portions of the Clip it is referencing. That is, the Real PlayList takes up in the disc 
10 the data capacity corresponding to a stream portion of the Clip it is referencing and, when Real PlayList is erased, the 
data of the stream portion of the Clip it is referencing is also erased. 

[0070] The Virtual PlayList is not co-owning Clip data. Therefore, if the Virtual PlayList is changed or erased, the 
contents of the Clip are in no way changed. 

[0071] The editing of the Real Playlist is explained. Fig.4A shows creation of Real PlayList and, if the AV stream is 
15 recorded as a new Clip, the Real PlayList which references the entire Clip is a newly created operation. 

[0072] Fig.4B shows the division of the real PlayList, that is the operation of dividing the Real PlayList at a desired 
point to split the Real PlayList in two Real PlayLists. This division operation is performed when two programs are 
managed in one clip managed by a sole PlayList and when the user intends to re-register or re-record the programs 
as separate individual programs. This operation does not lead to alteration of the Clip contents, that is to division of 
20 the Clip itself. 

[0073] Fig.4C shows the combining operation of the Real PlayList which is the operation of combining two Real 
PlayLists into one new Real PlayList. This combining operation is performed such as when the user desires to re- 
register two programs as a sole program. This operation does not lead to alteration of the Clip contents, that is to 
combining the clip itself into one. 
25 [0074] Fig.5A shows deletion of the entire Real PlayList. If the operation of erasing the entire preset Real PlayList, 
the associated stream portion of the Clip referenced by the deleted Real PlayList is also deleted. 
[0075] Fig.5B shows partial deletion of the Real PlayList. If a desired portion of the Real PlayList is deleted, the 
associated Playltem is altered to reference only the needed Clip stream portion. The corresponding stream portion of 
the Clip is deleted. 

30 [0076] Fig.5C shows the minimizing of the Real PlayList. It is an operation of causing the Playltem associated with 
the Real PlayList to reference only the stream portion of the Clip needed for Virtual PlayList. The corresponding stream 
portion of the Clip not needed for the Virtual PlayList is deleted. 

[0077] If the Real PlayList is changed by the above-described operation such that the stream portion of the Clip 
referenced by the Real PlayList is deleted, there is a possibility that the Virtual PlayList employing the deleted Clip is 

35 present such that problems may be produced in the Virtual PlayList due to the deleted Clip. 

[0078] In order to prevent this from occurring, such a message which runs: "If there exists the Virtual PlayList refer- 
encing the stream portion of the Clip the Real PlayList is referencing, and the Real PlayList is deleted, the Virtual 
PlayList itself is deleted - is it all right?" is displayed for the user in response to the user's operation of deletion by way 
of confirmation or alarming, after which the processing for deletion is executed or cancel led subject to user's commands. 

40 Alternatively, the minimizing operation for the Real PlayList is performed in place of deleting the Virtual PlayList. 

[0079] The operation for the Virtual PlayList is now explained. If an operation is performed for the Virtual PlayList, 
the contents of the Clip are not changed. Figs.6A and 6B show the assembling and editing (IN-OUT editing). It is an 
operation of creating Playltem of the playback domain the user has desired to view to create Virtual PlayList. The 
seamless connection between Playltems is supported by the application format, as later explained. 

45 [0080] If there exist two Real PlayLists 1 , 2 and clips 1 , 2 associated with the respective Real PlayLists, the user 
specifies a preset domain in the Real PlayList 1 (domain from IN1 to OUT1 : Playltem 1) as the playback domain, and 
also specifies, as the domain to be reproduced next, a preset domain in the Real PlayList 2 (domain from IN2 to OUT2: 
Playltem 2) as the playback domain, as shown in Fig. 6A, a sole Virtual PlayList made up of Playltem 1 and the Playltem2 
is prepared, as shown in Fig.6B. 

so [0081] The re-editing of the Virtual PlayList is now explained. The re-editing may be enumerated by alteration of IN- 
or OUT points in the Virtual PlayList, insertion or appendage of new Playltems to the Virtual PlayList and deletion of 
Playltems in the Virtual PlayList. The Virtual PlayList itself may also be deleted. 

[0082] Fig. 7 shows the audio dubbing (post recording) to the Virtual PlayList. It is an operation of registering the 
audio post recording to the Virtual PlayList as a sub path. This audio post recording is supported by the application 
55 software. An additional audio stream is added as a sub path to the AV stream of the main path of the Virtual PlayList. 
[0083] Common to the Real PlayList and the Virtual PlayList is an operation of changing (moving) the playback 
sequence of the PlayList shown in Fig. 8. This operation is an alteration of the playback sequence of the PlayList in the 
disc (volume) and is supported by TableOfPlayList as defined in the application format, as will be explained subse- 
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quently with reference to e.g., Fig.20. This operation does not lead to alteration of the Clip contents. 
[0084] The mark (Mark) is now explained. The mark is provided for specifying a highlight or characteristic time in the 
Clip and in the PlayList, as shown in Fig. 9. The mark added to the Clip is termed the ClipMark. The ClipMark is e.g., 
a program indexing point or a scene change point for specifying a characteristic scene ascribable to contents in the 
AV stream. The ClipMark is generated by e.g., the analysis unit 1 4 of Fig.1 . When the PlayList is reproduced, the mark 
of the Clip referenced by the PlayList may be referenced and used. 

[0085] The mark appended to the PlayList is termed the PlayListMark (play list mark). The PlayListMark is e.g., a 
bookmark point or a resuming point as set by the user. The setting of the mark to the Clip and to the PlayList is by 
adding a time stamp indicating the mark time point to the mark list. On the other hand, mark deletion is removing the 
time stamp of the mark from the mark list. Consequently, the AV stream is in no way changed by mark setting or by 
mark deletion. 

[0086] As another format of the ClipMark, a picture referenced by the ClipMark may be specified on the address 
basis in the AV stream. Mark setting on the Clip is by adding the address basis information indicating the picture of the 
mark point to the mark list. On the other hand, mark deletion is removing the address basis information indicating the 
mark point picture from the mark list. Consequently, the AV stream is in no way changed by mark setting or by mark 
deletion. 

[0087] A thumbnail is now explained. The thumbnail is a still picture added to the Volume, PlayList and Clip. There 
are two sorts of the thumbnail, one of them being a thumbnail as a representative picture indicating the contents. This 
is mainly used in a main picture in order for the user to select what the or she desired to view on acting on a cursor, 
not shown. Another sort of the thumbnail is a picture indicating a scene pointed by the mark. 

[0088] The Volume and the respective PlayLists need to own representative pictures. The representative pictures 
of the Volume are presupposed to be used for initially demonstrating a still picture representing the disc contents when 
the disc is set in position in the recording and/or reproducing apparatus 1 . It is noted that the disc means the recording 
medium 1 00 which is presupposed to be a of disc shape. The representative picture of the PlayList is presupposed to 
be used as a still picture for representing PlayList contents. 

[0089] As the representative picture of the PlayList, it may be contemplated to use the initial picture of the PlayList 
as the thumbnail (representative picture). However, the leading picture at the playback time of 0 is not necessarily an 
optimum picture representing the contents. So, the user is allowed to set an optional picture as a thumbnail of the 
PlayList. Two sorts of the thumbnails, that is the thumbnail as a representative picture indicating the Volume and the 
thumbnail as a representative picture indicating PlayList, are termed menu thumbnails. Since the menu thumbnails 
are demonstrated frequently, these thumbnails need to be read out at an elevated speed from the disc. Thus, it is 
efficient to store the totality of the menu thumbnails in a sole file. It is unnecessary for the menu thumbnails to be 
pictures extracted from the moving pictures in the volume, but may be a picture captured from a personal computer or 
a digital still camera, as shown in Fig. 10. 

[0090] On the other hand, the Clip and the PlayList need be marked with plural marks, whilst the pictures of the mark 
points need to be readily viewed in order to grasp the contents of the mark positions. The picture indicating such mark 
point is termed a mark thumbnail. Therefore, the picture which is the original of the mark thumbnail is mainly an extracted 
mark point picture rather than a picture captured from outside. 

[0091] Fig. 11 shows the relation between the mark affixed to the PlayList and the mark thumbnail, whilst Fig. 12 
shows the relation between the mark affixed to the Clip and the mark thumbnail. In distinction from the menu thumbnail, 
the mark thumbnail is used in e.g., a sub-menu for representing details of the PlayList, while it is not requested to be 
read out in a short access time. So, whenever a thumbnail is required, the recording and/or reproducing apparatus 1 
opens a file and reads out a portion of the file, while there is no problem presented even if file opening and reading 
out a portion of the file by the recording and/or reproducing apparatus 1 takes some time. 

[0092] For decreasing the number of files present in a volume, it is preferred for the totality of the mark thumbnails 
to be stored in one file. While the PlayList may own one menu thumbnail and plural mark thumbnails, the user is not 
required to select the Clip directly (usually, the Clip is selected through PlayList), and hence there is no necessity of 
providing menu thumbnails. 

[0093] Fig. 13 shows the relation between the menu thumbnails, mark thumbnails, PlayList and Clips. In the menu 
thumbnail file are filed menu thumbnails provided from one PlayList to another. In the menu thumbnail file is contained 
a volume thumbnail representing the contents of data recorded on the disc. In the menu thumbnail file are filed thumb- 
nails created from one PlayList to another and from one Clip to another. 

[0094] The CPI (Characteristic Point Information) is hereinafter explained. The CPI is data contained in the Clip 
information file and is used mainly for finding a data address in the Clip AV stream file at which to start the data readout 
when a time stamp of the access point to the Clip is afforded. In the present embodiment two sorts of the CPI are used, 
one of them being EP_map and the other being TLLmap. 

[0095] The EP_map is a list of entry point (EP) data extracted from the elementary stream and the transport stream. 
This has the address information used to find the site of entry points in the AV stream at which to start the decoding. 
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One EP data is made up of a presentation time stamp (PTS) and a data address in the AV stream of the accessing 
unit associated with the PTS, with the data address being paired to the PTS. 

[0096] The EP_map is used mainly for two purposes. First, it is used for finding a data address in the AV stream in 
the accessing unit referenced by the PTS in the PlayList. Second, the EP_map is used for fast forward playback or 
5 fast reverse playback. If, in recording the input AV stream by the recording and/or reproducing apparatus 1 , the syntax 
of the stream can be analyzed, the EP_map is created and recorded on the disc. 

[0097] The TU_map has a list of time unit (TU) data which is derived from the arrival time point of the transport packet 
input through a digital interface. This affords the relation between the arrival-time-based time and the data address in 
the AV stream. When the recording and/or reproducing apparatus 1 records an input AV stream, and the syntax of the 
10 stream cannot be analyzed, a TU_map is created and recorded on the disc. 

[0098] The STCInfo stores the discontinuous point information in the AV stream file which stores the MPEG-2 trans- 
port stream. 

[0099] When the AV stream has discontinuous points of STC, the same PTS values may appear in the AV stream 
file. Thus, if a time point in the AV stream is specified on the PTS basis, the PTS pf the access point is insufficient to 
15 specify the point. Moreover, an index of the continuous STC domain containing the PTS is required. In this format, the 
continuous STC domain and its index are termed an STC-sequence and STC-sequence-id, respectively. The STC- 
sequence information is defined by the STCInfo of the Clip Information file. 

[0100] The STC-sequence-id is used in an AV stream file and is optional in the AV stream file having the TU_map. 
[0101] The programs are each a collection of elementary streams and co-own a sole system time base for synchro- 

20 nized reproduction of these streams. 

[0102] It is useful for a reproducing apparatus (recording and/orreproducing apparatus 1 of Fig.1) to know the con- 
tents of an AV stream prior to its decoding. These contents include e.g., values of the PID of a transport packet trans- 
mitting an audio or video elementary stream or the type of the video or audio components, such as HDTV video or 
MPEG-2 AAC audio stream. This information is useful for creating a menu screen toy illustrating to the user the contents 

25 of the PlayList referencing the AV stream. It is similarly useful for setting the initial state of the AV decoder and the 
demultiplexer of the respective apparatus. 

[0103] For this reason, the Clip Information file owns Programlnfo for illustrating the program contents. 
[01 04] It may be an occurrence that program contents be changed in the AV stream file in which the M PEG-2 transport 
stream is stored. For example, the PID of the transport packet transmitting the video elementary stream may be 
30 changed, or the component type of the video stream may be changed from SDTV to HDTV. 

[0105] The Programlnfo stores the information on change points of program contents in the AV stream file. The 
domain of the AV stream file in which the program contents remain constant is termed program-sequence. 
[0106] This program-sequence is used in an AV stream file having EP_map and is optional in an AV stream file 
having TU_map. 

35 [0107] The present embodiment defines the self-encoding stream format (SESF). The SESF is used for encoding 
analog input signals and for decoding digital input signals for subsequently encoding the decoded signal into an MPEG- 
2 transport stream. 

[0108] The SESF defines an elementary stream pertinent to the MPEG-2 transport stream and the AV stream. When 
the recording and/or reproducing apparatus 1 encodes and records input signals in the SESF, an EP_map is created 
40 and recorded on the disc. 

[0109] A digital broadcast stream uses one of the following systems for recording on the recording medium 100: 
First, the digital broadcast stream is transcoded into an SESF stream. In this case, the recorded stream must conform 
to SESF and EP_map must be prepared and recorded on the disc. 

[0110] Alternatively, an elementary stream forming a digital broadcast stream is transcoded to a new elementary 
45 stream and re-multiplexed to a new transport stream conforming to the stream format prescribed by the organization 
for standardizing the digital broadcast stream. In this case, an EP_map must be created and recorded on the disc. 
[0111] For example, it is assumed that the input stream is an MPEG-2 transport stream conforming to the ISDB 
(standard appellation of digital BS of Japan), with the transport stream containing the HDTV video stream and the 
MPEG AAC audio stream. The HDTV video stream is transcoded to an SDTV video stream, which SDTV video stream 
50 and the original AAC audio stream are re-multiplexed to TS. The SDTV stream and the transport stream both need to 
conform to the ISDB format. 

[01 12] Another system of recording the digital broadcast stream on the recording medium 1 00 is to make transparent 
recording of the input transport stream, that is to record the input transport stream unchanged, in which case the 
EP_map is formulated and recorded on the disc. 
55 [01 13] Alternatively, the input transport stream is recorded transparently, that is an input transport stream is recorded 
unchanged, in which case TU_map is created and recorded on the disc. 

[0114] The directory and the file are hereinafter explained. The recording and/or reproducing apparatus 1 is herein- 
after described as DVR (digital video recording). Fig. 14 shows a typical directory structure on the disc. The directories 
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of the disc of the DVR may be enumerated by a root directory including "DVR" directory, and the "DVR" directory, 
including "PLAYLIST" directory, "CLIPINF" directory, "M2TS" directory and "DATA" directory, as shown in Fig. 14. Al- 
though directories other than these may be created below the root directory, these are discounted in the application 
format of the present embodiment. 
5 [01 1 5] Below the "DATA" directory, there are stored all files and directories prescribed by the DVR application format. 
The "DVR" directory includes four directories. Below the "PLAYLIST 1 directory are placed Real PlayList and Virtual 
PlayList database files. The latter directory may exist in a state devoid of PlayList. 

[01 1 6] Below "CLI PINF" is placed a Clip database. This directory, too, may exist in a state devoid of AV stream files. 

In the "DATA" directory, there are stored files of data broadcast, such as digital TV broadcast. 
10 [01 17] The "DVR" directory stores the following files: That is, an "info.dvr" is created below the DVR directory to store 

the comprehensive information of an application layer. Below the DVR directory, there must be a sole info.dvr. The 

filename is assumed to be fixed to info.dvr. The "menu.thmb" stores the information pertinent to the menu thumbnails. 

Below the DVR directory, there must be 0 or 1 mark thumbnail. The filename is assumed to be fixed to "menu.thmb". 

If there is no menu thumbnail, this file may not exist. 
15 [0118] The "mark.thmb" file stores the information pertinent to the mark thumbnail picture. Below the DVR directory, 

there must be 0 or 1 mark thumbnail. The filename is assumed to be fixed to "menu.thmb". If there is no menu thumbnail , 

this file may not exist. 

[0119] The "PLAYLIST' directory stores two sorts of the PlayList files which are Real PlayList and Virtual PlayList. 
An "xxxxx.rpls" file stores the information pertinent to one Real PlayList. One file is created for each Real PlayList. The 
20 filename is "xxxxx.rpls", where "xxxxx" denotes five numerical figures from 0 to 9. A file extender must be "rpls". 

[0120] The "yyyyy.vpls" stores the information pertinent to one Virtual PlayList. One file with a filename ,, yyyyy,vpls ,, 
is created from one Virtual PlayList to another, where "yyyyy" denotes five numerical figures from 0 to 9. A file extender 
must be "vpls". 

[0121] The "CLI PINF" directory stores one file in association with each AV stream file. The "zzzzz.clpi" is a Clip 
25 Information file corresponding to one AV stream file (Clip AV stream file or Bridge-Clip stream file). The filename is 

"zzzzz.clpi", where "zzzzz" denotes five numerical figures from 0 to 9. A file extender must be "dpi". 

[0122] The "M2TS" directory stores an AV stream file. The "zzzzz.m2ts" file is an AV stream file handled by the DVR 

system. This is a Clip AV stream file or a Bridge-Clip AV stream file. The filename is "zzzzz. m2ts", where "zzzzz" 

denotes five numerical figures from 0 to 9. A file extender must be "m2ts". 
30 [0123] The "DATA" directory stores data transmitted from data broadcasting. This data may, for example, be XML 

or MPEG files. 

[0124] The syntax and the semantics of each directory (file) are now explained. Fig. 15 shows the syntax of the "info, 
dvr" file. The "info.dvr" file is made up of three objects, that is DVRVoume(), TableOf PlayLists() and Makers PrivateData 
(). 

35 [01 25] The syntax of info.dvr shown in Fig. 1 5 is explained. The TableOf Play Lists_Start_ad dress indicates the leading 
address of the TableOfPlayLists() in terms of the relative number of bytes from the leading byte of the "info.dvr" file. 
The relative number of bytes is counted beginning from 0. 

[0126] The Makers PrivateData_Start_address indicates the leading address of the MakersPrivateData(), in terms 
of the relative number of bytes as from the leading byte of the "info.dvr" file. The relative number of bytes is counted 
40 from 0. The padding_word is inserted in association with the syntax of "info.dvr". N1 and N2 are optional positive 
integers. Each padding word may assume an optional value. 

[0127] The DVRVolume() stores the information stating the contents of the volume (disc). Fig.1 6 shows the syntax 
of the DVRVolume. The syntax of the DVRVolume(), shown in Fig. 16, is now explained. The version_n umber indicates 
four character letters indicting the version numbers of the DVRVolume(). The version_number is encoded to "0045" in 
45 association with IS0646. 

[0128] Length is denoted by 32-bit unsigned integers indicating the number of bytes from directly after the length 
field to the trailing end of DVRVolumeQ. 

[0129] The ResumeVolume() memorizes the filename of the Real PlayList or the Virtual PlayList reproduced last in 
the Volume. However, the playback position when the user has interrupted playback of the Real PlayList or the Virtual 
so PlayList is stored in the resume-mark defined in the PlayListMark() (see Figs.42 and 43). 

[0130] Fig. 17 shows the syntax of the ResumeVolume(). The syntax of the ResumeVolume() shown in Fig. 17 is 
explained. The valid_Jiag indicates that the resume_PlayList_name field is valid or invalid when this 1 -bit flag is set to 
1 or 0, respectively. 

[0131] The 10-byte field of resume_PlayList_name indicates the filename of the Real PlayList or the Virtual PlayList 
55 to be resumed. 

[0132] The UlApplnfoVolume in the syntax of the DVRVolume(), shown in Fig. 16, stores parameters of the user 
interface application concerning the Volume. Fig.1 8 shows the syntax of the U lApplnfo Volume, the semantics of which 
are now explained. The 8-bit field of character_set indicates the encoding method for character letters encoded in the 
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Volume_name field. The encoding method corresponds to the values shown in Fig. 19. 

[0133] The 8-bit field of the namejength indicates the byte length of the Volume name indicated in the Volume_name 
field. The Volume_name field indicates the appellation of the Volume. The number of bytes of the number of the 
namejength counted from left of the field is the number of valid characters and indicates the volume appellation. The 

5 values next following these valid character letters may be any values. 

[0134] The Volume_protect_flag is a flag indicating whether or not the contents in the Volume can be shown to the 
user without limitations. If this flag is set to 1 , the contents of the Volume are allowed to be presented (reproduced) to 
the user only in case the user has succeeded in correctly inputting the PIN number (password). If this flag is set to 0, 
the contents of the Volume are allowed to be presented to the user even in case the PIN number is not input by the user. 

10 [0135] If, when the user has inserted a disc into a player, this flag has been set to 0, or the flag is set to 1 but the 
user has succeeded in correctly inputting the PIN number, the recording and/or reproducing apparatus 1 demonstrates 
a list of the PlayList in the disc. The limitations on reproduction of the respective PlayLists are irrelevant to the 
Volume_protectJlag and are indicated by playback_control_flag defined in the UlApplnfoVolume. 
[0136] The PIN is made up of four numerical figures of from 0 to 9, each of which is coded in accordance with ISO/ 

15 |EC 646. The ref Jhumbnailjndex field indicates the information of a thumbnail picture added to the Volume. If the 
ref_thumbnail_index field is of a value other than OxFFFF, a thumbnail picture is added to the Volume. The thumbnail 
picture is stored in a menu.thumbfile. The picture is referenced using the value of the ref JhumbnaiUndex in the menu, 
thumb file. If the ref_thumbnail_index field is OxF FFF it indicates that a thumbnail picture has been added to the Volume. 
[0137] The TableOfPlayList() in the info.dvr syntax shown in Fig. 15 is explained. The TableOfPlayList() stores the 

20 filename of the PlayList (Real PlayList and Virtual PlayList). All PlayList files recorded in the Volume are contained in 
the TableOfPlayList(), which TableOfPlayList() indicates the playback sequence of the default of the PlayList in the 
Volume. 

[0138] Fig.20 shows the syntax of the TableOfPlayList(), which is now explained. The version_number of the Table- 
OfPlayList() indicates four character letters indicating the version numbers of the TableOfPlayLists. The 

25 version_number must be encoded to "0045" in accordance with ISO 646. 

[0139] Length is a unsigned 32-bit integer indicating the number of bytes of the TableOfPlayList() from directly after 
the length field to the trailing end of the TableOfPlayList(). The 16-bit field of the n urn ber_of_P lay Lists indicates the 
number of loops of the for-loop inclusive of the PlayList_file_name. This numerical figure must be equal to the number 
of PlayLists recorded in the Volume. The 1 0-byte numerical figure of the Play List_file_name indicates the filename of 

30 the PlayLists. 

[0140] Fig.21 shows another configuration of the syntax of the TableOfPlayList(). The syntax shown in Fig.21 is 
comprised of the syntax shown in Fig.20 in which is contained the UlApplnfoPlayList. By such structure including the 
UlApplnfoPlayList, it becomes possible to create a menu picture simply on reading out the TableOfPlayLists. The 
following explanation is premised on the use of the syntax shown in Fig.20. 

35 [0141] The MakersPrivateData in the info.dvr shown in Fig. 15 is explained. The Makers Private Data is provided to 
permit the maker of the recording and/or reproducing apparatus 1 to insert private data of the maker in the MakersPri- 
vateData() for special applications of different companies. The private data of each maker has standardized makerJD 
for identifying the maker who has defined it. The Makers PrivateData() may contain one or more makerJD. 
[0142] If a preset maker intends to insert private data, and the private data of a different maker is already contained 

40 in the MakersPrivateDataQ, the new private data is added to the MakersPrivateData() without erasing the pre-existing 
old private data. Thus, in the present embodiment, private data of plural makers can be contained in one MakersPri- 
vateDataQ. 

[0143] Fig. 22 shows the syntax of the MakersPrivateData. The syntax of the MakersPrivateData shown in Fig.22 is 
explained. The version_number of the TableOf PlayListQ indicates four character letters indicating the version numbers 
■45 of the TableOfPlayLists. The version__n umber must be encoded to ,, 0045 ,, in accordance with ISO 646. Length is a 
unsigned 32-bit integer indicating the number of bytes of the TableOf PlayList() from directly after the length field to the 
trailing end of the Makers PrivateData(). 

[0144] The mpd_blocks_start_address indicates the leading end address of the first mpd_block() in terms of the 
relative number of bytes from the leading byte of the MakersPrivateData(). The number_of_maker_entries is the 1 6-bit 

50 codeless integer affording the number of entries of the maker private data included in the MakersPrivateData(). There 
must not be present two or more maker private data having the same makerJD values in the MakersPrivateDataQ. 
[0145] The mpd_blocks_size is a 16-bit unsigned integer affording one mpd_block size in terms of 1024 bytes as a 
unit. For example, if mpd_block_size = 1, it indicates that the size of one mpd_block is 1024 bytes. The 
number_of_mpd_blocks is a 16-bit unsigned integer affording the number of mpdjDlocks contained in the MakersPri- 

55 vateDataQ. The makerJD is the 1 6-bit unsigned integer indicating the model number code of the DVR system which 
has created the maker private data. The value encoded to the makerJD is specified by the licensor. 
[0146] The maker_model_code is a 16-bit unsigned integer indicating the model number code of the DVR system 
which has created the maker private data. The value encoded to the maker_model_code is set by the maker who has 
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received the license of the format. The start_mpd_block_n umber is a 1 6-bit unsigned integer indicating the number of 
the mpdjalock number at which begins the maker private data. The leading end of the maker private data must be 
aligned with the leading end of the mpd _block. The start_mpd_block_number corresponds to a variable j in the for- 
loop of the mpd_block. 

5 [0147] The mpdjength is a 32-bit unsigned integer indicating the size of the maker private data. The mpd_block is 
an area in which is stored maker private data. All of the mpd_blocks in the MakersPrivateData() must be of the same 

size. 

[0148] The real PlayList file and the Virtual PlayList file, in other words, xxxxx.rpls and yyyyy.vpls, are explained. 
Fig.23 shows the syntax of xxxxx.rpls (Real PlayList) and yyyyy.vpls (Virtual PlayList), which are of the same syntax 
10 structure. Each of the xxxxx.rpls and yyyyy.vpls is made up of three objects, that is PlayList(), PlayListMark() and 
Makers P ri vateData() . 

[0149] The PlayListMark_Start_address indicates the leading address of the PlayListMark(), in terms of the relative 
number of bytes from the leading end of the PlayList file as a unit. The relative number of bytes is counted from zero. 
[0150] The MakersPrivateData_Start_address indicates the leading address of the MakersPrivateDataQ, in terms 
is of the relative number of bytes from the leading end of the PlayList file as a unit. The relative number of bytes is counted 
from zero. 

[0151] The padding_word (padding word) is inserted in accordance with the syntax of the PlayList file, with N1 and 
N2 being optional positive integers. Each padding word may assume an optional value. 

[0152] PlayList will be further explained in the following although it has been explained briefly. A playback domain 
20 in all Clips except Bridge-Clip must be referred by all PlayLists in the disc. Also, two or more Real PlayLists must not 
overlap the playback domains shown by their Playltems in the same Clip. 

[0153] Reference is made to Figs.24A, 24B and 24C. For ail Clips, there exist corresponding Real PlayLists, as 
shown in Fig.24A. This rule is observed even after the editing operation has come to a close, as shown in Fig.24B. 
Therefore, all Clips must be viewed by referencing one of Real PlayLists. 
25 [0154] Referring to Fig.24C, the playback domain of the Virtual PlayList must be contained in the playback domain 
and in the Bridge-Clip playback domain . There must not be present in the disc Bridge-Clip not referenced by any Virtual 
PlayList. 

[0155] The Real PlayList, containing the list of the Playltem, must not contain SubPlayltem. The Virtual PlayList 
contains the Playltem list and, if the CPI_type contained in the PlayList() is the EP_map type and the PlayList_type is 
30 o (PlayList containing video and audio) , the Virtual PlayList may contain one SubPlayltem. In the PlayList() in the 
present embodiment, the SubPlayltem is used only for audio post recording. The number of the SubPlayltems owned 
by one Virtual PlayList must be 0 or 1 . 

[0156] The PlayList is hereinafter explained. Fig.25 shows the PlayList syntax which is now explained. The 
version_number indicates four character letters indicting the version numbers of the PlayList(). The version_number 
35 js encoded to "0045" in association with ISO 646. Length is a 32-bit unsigned integer indicating the total number of 
byte of the PlayList() as from directly after the length field to the trailing end of the PlayList(). The PlayListJype, one 
example of which is shown in Fig. 26, is an 8-bit field indicating the PlayList type. 

[0157] The CPI_type is one-bit flag indicating the value of the CPI_type of the Clip referenced by the Playltem() and 
by the SubPlayltem(). The CPI__types defined in the CPIs of all Clips referenced by one PlayList must be of the same 

40 values. The number_of_Playltems is a 16-bit field indicating the number of Playltems present in the PlayList. 

[0158] The Playltemjd corresponding to the preset Playltem() is defined by the sequence in which the Playltem() 
appears in the for-loop containing the P!ay!tem(). The Playltemjd begins with 0. The nimber_of_Sub Play Items is a 
16-bit field indicating the number of SubPlayltem in the PlayList. This value is 0 or 1 . An additional audio stream path 
(audio stream path) is a sort of a sub path. 

45 [0159] The UlApplnfoPlayList of the PlayList syntax shown in Fig.25 is explained. The UlApplnfoPlayList stores 
parameters of the user interface application concerning the PlayList. Fig. 27 shows the syntax of the UlApplnfoPlayList, 
which is now explained. The character_set is an 8-bit field indicating the method for encoding character letters encoded 
in the PlayList_name field. The encoding method corresponds to the values conforming to the table shown in Fig. 19. 
[0160] The namejength is an 8-bit field indicating the byte length of the PlayList name indicated in the PlayList_name 

so field. The PlayList_name field shows the appellation of the PlayList. The number of bytes of the number of the 
namejength counted from left of the field is the number of valid characters and indicates the PlayList appellation. The 
values next following these valid character letters may be any values. 

[0161] The record_time_and_date is a 56-bit field storing the date and time on which the PlayList was recorded. This 
field is 14 numerical figures for year/ month/ day/hour/minute/ second encoded in binary coded decimal (BCD). For 
55 example, 2001/12/23:01:02:03 is encoded to "0x20011223010203". 

[0162] The duration is a 24-bit field indicating the total replay time of the PlayList in terms of hour/ minute/ second 
as a unit. This field is six numerical figures encoded in binary coded decimal (BCD). For example, 01 :45:30 is encoded 
to "0x01 4530". 
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[0163] The valid jDeriod is a 32-bit field indicating the valid time periods of the PlayList. This field is 8 numerical 
figures encoded in 4-bit binary coded decimal (BCD). The valid_period is used in the recording and/or reproducing 
apparatus 1 e.g., when the PlayList, for which the valid period has lapsed, is to be automatically erased. For example, 
2001/05/07 is encoded to "0x2001 0507". 

[0164] The makerJD is a 16-bit unsigned integer indicating the maker of the DVR player (recording and/or repro- 
ducing apparatus 1) which has been the last to update its PlayList. The value encoded to maker_ID is assigned to the 
licensor of the DVR format. The maker_code is a 16-bit unsigned integer indicating the model number of the DVR 
player which has been the last to update the PlayList. The value encoded to the maker_code is determined by the 
maker who has received the license of the DVR format. 

[0165] If the flag of the playback_control_flag is set to 1 , its PlayList is reproduced only when the user successfully 
entered the PIN number. If this flag is set to 0, the user may view the PlayList without the necessity of inputting the 
PIN number. 

[0166] If the write_protectJlag is set to 1, the contents of the PlayList are not erased nor changed except the 
write_protect_flag. If this flag is set to 0, the user is free to erase or change the PlayList. If this flag is set to 1 , the 
recording and/or reproducing apparatus 1 demonstrates a message requesting re-confirmation by the user before the 
user proceeds to erase, edit or overwrite the PlayList. 

[0167] The Real PlayList, in which the write_protect_flag is set to 0, may exist, the Virtual PlayList, referencing the 
Clip of the Real PlayList may exist, and the write_protect_flag of the Virtual PlayList may be set to 1 . If the user is 
desirous to erase the Real PlayList, the recording and/or reproducing apparatus 1 issues an alarm to the user as to 
the presence of the aforementioned Virtual PlayList or "minimizes" the Real PlayList before erasing the Real PlayList. 
[0168] If is_played_flag is set to 1 , as shown in Fig.28B, it indicates that the PlayList was reproduced at least once 
since it was recorded, whereas, if it is set to 0, it indicates that the PlayList was not reproduced even once since it was 
recorded. 

[0169] Archive is a two-bit field indicating whether the PlayList is an original or a copy, as shown in Fig.28C. The 
field of ref_thumbnaiUndex indicates the information of a thumbnail picture representative of the PlayList. If the 
ref_thumbnaiUndex f j e id is of a value other than OxFFFF, a thumbnail picture representative of the PlayList is added 
in the PlayList, with the PlayList being stored in the menu.thmb file. The picture is referenced using the value of 
ref_thumbnail_index in the menu.thmb file. If the ref_thumbnail_index field is OxFFFF, no thumbnail picture represent- 
ative of the PlayList is added in the PlayList. 

[0170] The Play Item is hereinafter explained. One PlayltemQ basically contains the following data: 
Clip_lnformation_file_name for specifying the filename of the Clip, IN-time and OUT-time, paired together to specify 
the playback domain of Clip, STC_sequence_id referenced by IN-time and OUT-time in case the CPl_type defined in 
PlayList() is EP_map type, and Connection_Condition indicating the connection condition of previous Playltem and 
current Playltem. 

[0171] If PlayList is made up of two or more Playltems, these Piayltems are arrayed in a row, without temporal gap 
or overlap, on the global time axis of the PlayList. If CPI_type defined in the PlayList is EP_map type and the current 
PlayList does not have the BridgeSequence(), the IN-time and OUT-time pair must indicate the same time on the STC 
continuous domain as that specified by the STC_sequence_id. Such instance is shown in Fig.29. 
[0172] Fig.30 shows such a case in which the CPI_type defined by PlayList() and, if the current Playltem has the 
BridgeSequence(), the rules as now explained are applied. The INJime of the Playltem previous to the current Play- 
item, shown as IN_time1 , indicates the time in Bridge-Clip specified in the BridgeSequencelnfo() of the current Play- 
ltem. This OUTJime must obey the encoding limitations which will be explained subsequently. 
[0173] The IN_time of the current Playltem, shown as IN_time2, indicates the time in Bridge-Clip specified in the 
BridgeSequencelnfoQ of the current Playltem. This IN_time also must obey the encoding limitations as later explained. 
The OUTJime of Playltem of the current Playltem, shown as OUT_time2, indicates the time on the STC continuous 
domain specified by STC_sequenbce_id of the current Playltem. 

[0174] If the CPI_type of PlayList() isTU_map type, the IN-time and OUTJime of Playltem, paired together, indicate 
the time on the same Clio AV stream, as shown in Fig.31 . 

[0175] The Playltem syntax is as shown in Fig. 32. As to the syntax of the Playltem, shown in Fig.32, the field of the 
Clip Jnformation_file_name indicates the filename of the Clip I nformation. The Clip_stream_type defined by the Cliplnfo 
() of this Clip Information file must indicate the Clip AV stream. 

[0176] The STC_sequenceJd is an 8-bit field and indicates the STC_sequenceJd of the continuous STC domain 
referenced by the Playltem. If the CPLtype specified in the PlayListQ is TU_map type, this 8-bit field has no meaning 
and is set to 0. IN_time is a 32-bit field and used to store the playback start time of Playltem. The semantics of IN_time 
differs with CPLtype defined in the PlayList(), as shown in Fig. 33. 

[01 77] OUT_time is a 32-bit field and is used to store the playback end time of Playltem. The semantics of OUT_time 
differs with CPI_type defined in the PlayListQ, as shown in Fig. 34. 

[0178] Connection_condition is a 2-bit field indicating the connection condition between the previous Playltem and 
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the current Playltem, as shown in Fig.35. Figs.36A to 36D illustrate various states of Connectlon_condition shown in 
Fig. 35. 

[0179] BridgeSequencelnfo is explained with reference to Fig.37. This BridgeSequencelnfo is the ancillary informa- 
tion of the current Playltem and includes the following information. That is, BridgeSequencelnfo includes 
5 Bridge_ClipJnformation_file_name for specifying the Bridge_Clip AV file and a Bridge_Clip_lnformation_file_name 
specifying the corresponding Clip Information file (Fig. 45). 

[0180] It is also an address of a source packet on the Clip AV stream referenced by the previous Playltem. Next to 
this source packet is connected the first source packet of the Bridge-Clip AV stream. This address is termed the 
RSPN_exitJrom_previous_Clip. It is also an address of the source packet on the Clip AV stream referenced by the 
10 current Playltem. Ahead of this source packet is connected the last source packet of the Bridge_clip AV stream file. 
This address is termed RSPN_enter_to_current_Clip. 

[0181] In Fig.37, RSPN_arrival_time_discontinuity indicates an address of a source packet in the Bridge„Clip AV 
stream where there is a discontinuous point in the arrival time base. This address is defined in the Cliplnfo() (Fig.46). 
[0182] Fig. 38 shows the syntax of the BridgeSequencelnfo. Turning to the syntax of BridgeSequencelnfo shown in 
is Fig. 38, the field of Bridge_Clip_lnformation_file_name indicates the filename of the Clip Information file corresponding 
to the Bridge_Clip Informationjile. The Clip_stream_type defined in Cliplnfo() of this Clip information file must indi- 
cate 'Bridge_Clip AV stream'. 

[0183] The 32-bit field of the RSPN_exit_from_previous_Clip is a relative address of a source packet on the Clip AV 
stream referenced by the previous Playltem. Next to this source packet is connected the first source packet of the 
20 Bridge_Clip AV stream file. The RSPN_exit_frorn_previous_Clip has a size based on the source packet number as a 
unit, and is counted with the value of the offset_SPN defined in the Cliplnfo() from the first source packet of the Clip 
AV stream file referenced by the previous Playltem. 

[0184] The 32-bit field of RSPN_enter_to_curent__Clip is the relative address of the source packet on the Clip AV 
stream referenced by the current Playltem. Ahead of this source packet is connected the last source packet of the 

25 Bridge_Clip_AV stream file. The RSPN_enter_to_curent_Clip has a size that is based on the source packet number 
as a unit. The RSPN_enter_to_curent_Clip is counted with the value of the offset_SPN, defined in the CliplnfoQ from 
the first source packet of the Clip AV stream file referenced by the current Playltem, as an initial value. 
[0185] The SubPlayltem is explained with reference to Fig.39. The use of SubPlayltemQ is permitted only if the 
CPI_type of the Playl_ist() is the EP_map type. In the present embodiment, SubPlayltem is used only for audio post 

30 recording. The SubPlayltemQ includes the following data. First, it includes Clip_lnformation_file_name for specifying 
the Clip referenced by the sub path in the PlayList. 

[0186] It also includes SubPath_IN_time and SubPath_OUT_time for specifying the sub path playback domain in 
the Clip. Additionally, it includes sync__Playltem_id and start_PTS_of_P lay Item for specifying the time of starting the 
sub path reproduction on the main path time axis. The Clip AV stream, referenced by the sub path, must not contain 
35 STC discontinuous points (discontinuous points of the system time base) . The clocks of audio samples of the Clip used 
in the sub path are locked to the clocks of the audio samples of the main path. 

[0187] Fig.40 shows the syntax of the SubPlayltem. Turning to the syntax of the SubPlayltem, shown in Fig.40, the 
field of the Clip_lnformation_file_name indicates the filename of the Clip Information file and is used by a sub path in 
the PlayList. The Clip_stream_type defined in this CliplnfoQ must indicate the Clip AV stream. 
40 [0188] An 8-bit field of sync_Playltem_id indicates the sub path type. Here, only '0x00' is set, as shown in Fig. 41 , 
while other values are reserved for future use. 

[0189] The 8-bit field of sync_Playltem_id indicates the Playltemjd of the Playltem containing the time of playback 

start of the sub path on the time axis of the main path. The value of Playltemjd corresponding to the preset Playltem 

is defined in the PlayListQ (Fig.25). 
45 [0190] A 32-bit field of sync_start_PTS_of_Playltem denotes the time of playback start of the sub path on the time 

axis of the main path, and denotes the upper 32 bits of the PTS (presentation time stamp) on the Playltem referenced 

by the sync_Playltem _id. The upper 32 bit field of the SubPath_IN_time stores the playback start time of the sub path . 

SubPath_IN_time denotes upper 32 bits of the PTS of 33 bits corresponding to the first presentation unit in the sub path. 

[0191] The upper 32 bitfield of sub Path_OUT_time stores the playback end time of the sub path. SubPath_OUT_time 
50 indicates upper 32 bits of the value of the Presentation_end_TS calculated by the following equation: 

Presentation_end_TS = PTS_OUT + AU_duration 

55 where PTS_out is the PTS of the 33 bit length corresponding to the last presentation unit of the SubPath and 
AU_duration is the 90 kHz based display period of the last presentation unit of the SubPath. 

[0192] Next, PlayListMarkQ in the syntax of xxxxx.rpls and yyyyy.vpls shown in Fig.23 is explained. The mark infor- 
mation pertinent to the PlayList is stored in this PlayListMark. Fig.42 shows the syntax of PlayListMark. Turning to the 
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syntax of the PlayListMark shown in Fig. 42, version_number is four character letters indicating the version number of 
this PlayListMark(). The version_number must be encoded to "0045" in accordance with ISO 646. 
[0193] Length is an unsigned 32 -bit integer indicating the number of bytes of Playl_istMark() from directly after the 
length field to the trailing end of the PlayListMarkQ. The number_of_PlayListMarks is a 1 6-bit unsigned integer indicating 
5 the number of marks stored in the PlayListMark. The number__of_PlayListMarks may be zero. The markjype is an 
8-bit field indicating the mark type and is encoded in the table shown in Fig.43. 

[0194] A 32-bit filed of mark_time_stamp stores a time stamp indicating the point specified by the mark. The seman- 
tics of the markjime_stamp differ with CPI_type defined in the PlayListQ, as shown in Fig. 44. The Playltem_id is an 
8-bit field specifying the Playltem where the mark is put. The values of Playltemjd corresponding to a preset Playltem 

10 is defined in the PlayList() (see Fig. 25). 

[0195] An 8-bit field of character_set shows the encoding method of character letters encoded in the mark_name 
field. The encoding method corresponds to values shown in Fig. 19. The 8-bit field of namejength indicates the byte 
length of the mark name shown in the mark_name field. The mark_name field denotes the mark name indicated in the 
mark_name field. The number of bytes corresponding to the number of namejengths from left of this field is the 

15 effective character letters and denotes the mark name. I n the mark_name field, the value next following these effective 
character letters may be arbitrary. 

[0196] The field of the ref_thumbnail_index denotes the information of the thumbnail picture added to the mark. If 
the field of the ref_thumbnail_index is not OxFFFF, a thumbnail picture is added to its mark, with the thumbnail picture 
being stored in the mark.thmb file. This picture is referenced in the mark.thmb file, using the value of 
20 ref_thumbnail_index, as explained subsequently. If the refJhumbnaiUndex field is OxFFFF, it indicates that no thumb- 
nail picture is added to the mark. 

[0197] The Clip Information file is now explained. The zzzzz.clpi (Clip Information file) is made up of six objects, as 
shown in Fig.45. These are Cliplnfo(), STC_lnfo(), Program(), CPI(), ClipMarkQ and MakersPrivateDataQ. For the AV 
stream (Clip AV stream or Bridge-Clip AV stream) and the corresponding Clip Information file, the same string of nu- 
25 m era Is "zzzzz" is used. 

[0198] Turning to the syntax of zzzzz.clpi (Clip Information file) shown in Fig.45 is explained. The 
Cliplnfo_Start_address indicates the leading end address of Cliplnfo() with the relative number of bytes from the leading 
end byte of the zzzzz.clpi file as a unit. The relative number of bytes is counted from zero. 

[01 99] The STC_lnfo_Start_address indicates the leading end address of STCJnfo with the relative number of bytes 
30 from the leading end byte of the zzzzz.clpi file as a unit. The Program I nfo_Start_address indicates the leading end 
address of Programlnfo() with the relative number of bytes from the leading end byte of the zzzzz.clpi file as a unit. 
The relative number of bytes is counted from 0. The CPI_Start_address indicates the leading end address of CPI() 
with the relative number of bytes from the leading end byte of the zzzzz.clpi file as a unit. The relative number of bytes 
is counted from zero. 

35 [0200] The ClipMark_Start_address indicates the leading end address of ClipMark() with the relative number of bytes 
from the leading end byte of the zzzzz.clpi file as a unit. The relative number of bytes is counted from zero. 
TheJvlakersPrivateData Start_address indicates the leading end address of Makers PrivateData() with the relative 
number of bytes from the leading end byte of the zzzzz.clpi file as a unit. The relative number of bytes is counted from 
zero. The padding_word is inserted in accordance with the syntax of the zzzzz.clpi file. N1 , N2, N3, N4 and N5 must 

40 be zero or optional positive integers. The respective padding words may also assume optional values. 

[0201] The Cliplnfo is now explained. Fig. 46 shows the syntax of Cliplnfo. Fig. 46 shows the syntax of Cliplnfo. In 
the Cliplnfo() is stored the attribute information of corresponding AV stream files (Clip AV stream or Bridge-Clip AV 
stream file). 

[0202] Turning to the syntax of the Cliplnfo shown in Fig. 46, version_number is the four character letters indicating 
45 the version number of this Cliplnfo(). The version_number must be encoded to "0045" in accordance with the ISO 646. 
Length is a 32-bit unsigned integer indicating the number of bytes of Cliplnfo() from directly at back of the length field 
to the trailing end of the Cliplnfo(). An 8-bit field of Clip_stream_type indicates the type of the AV stream corresponding 
to the Clip Information file, as shown in Fig. 47. The stream types of the respective AV streams will be explained sub- 
sequently. 

so [0203] The 32-bit field of offset_SPN gives an offset value of the source packet number of the first source packet 
number of the first source packet of the AV stream (Clip AV stream or the Bridge-Clip AV stream). When the AV stream 
file is first recorded on the disc, this offset_SPN must be zero. 

[0204] Referring to Fig. 48, when the beginning portion of the AV stream file is erased by editing, offset_SPN may 
assume a value other than 0. In the present embodiment, the relative source packet number (relative address) refer- 
55 encing the offset_SPN is frequently described in the form of RSPNxxx, where xxx is modified such that RSPN_xxx is 
RAPN_EP_start. The relative source packet number is sized with the source packet number as a unit and is counted 
from the first source packet number of the AV stream file with the value of the offset_SPN as the initial value. 
[0205] The number of source packets from the first source packet of the AV stream file to the source packet referenced 
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by the relative source packet number (SPN_xxx) is calculated by the following equation: 

SPN_xxx = RSPN_xxx - offset_SPN. 
Fig.48 shows an instance in which offset_SPN is 4. 

[0206] TS_recording_rate is a 24-bit unsigned integer, which affords an input/output bit rate required for the AV 
stream to the DVR drive (write unit 22) or from the DVR drive (readout unit 28). The record_time_and__date is a 56-bit 
field for storing the date and time of recording of the AV stream corresponding to the Clip and is an encoded represen- 
tation of year/month/day/hour/minute in 4-bit binary coded decimal (BCD) for 14 numerical figures. For example, 
2001/2/23:01 .02:03 is encoded to "0x20011223010203". 

[0207] The duration is a 24-bit field indicating the total playback time of the Clip by hour/minute/second based on 
arrival time clocks. This field is six numerical figures encoded in 4-bit binary coded decimal (BCD). For example, 01 : 
45:30 is encoded to "0x014530". 

[0208] A flag time_controlled_flag indicates the recording mode of an AV stream file. If this time_controlled_flag is 
1 , it indicates that the recording mode is such a mode in which the file size is proportionate to the time elapsed since 
recording, such that the condition shown by the following equation: 

Ts_average_rate*192/188*(t - start_time)- a <= size_clip(t) 
<= TS_average_rate*1 92/188*(t - start_time) + a 

where TS_average_rate is an average bit rate of the transport stream of the AV stream file expressed by bytes/second. 
[0209] In the above equation, t denotes the time in seconds, while start_time is the time point when the first source 
packet of the AV stream file was recorded. The size_clip(t) is 10*192 bytes and a is a constant dependent on 
TS_average_rate. 

[0210] If time_controlled Jlag is set to 0, it indicates that the recording mode is not controlling so that the time lapse 
of recording is proportionate to the file size of the AV stream. For example, the input transport stream is recorded in a 
transparent fashion. 

[0211] If time_controlIed_flag is set to 1 , the 24-bit field of TS_average_rate indicates the value of TS_average_rate 
used in the above equation. If time_controlled_flag is set to 0, this field has no meaning and must be set to 0. For 
example, the variable bit rate transport stream is encoded by the following sequence: First, the transport rate is set to 
the value of TS_recording_rate. The video stream is encoded with a variable bit rate. The transport packet is intermit- 
tently encoded by not employing null packets. 

[0212] The 32-bit field of RSPN_arrival_time_discontinuity is a relative address of a site where arrival timebase 
discontinuities are produced on the Bridge-Clip AV stream file. The RSPN_arrivaLtime_discontinuity is sized with the 
source packet number as a unit and is counted with the value of offset_SPN defined in the Cliplnfo() as from the first 
source packet of the Bridge-Clip AV stream file. An absolute address in the Bridge-Clip AV stream file is calculated 
based on the aforementioned equation: 

SPN_xxx = RSPN_xxx - offset_SPN. 

[0213] The 1 44-bit field of reserve r_for_system_use is reserved for a system. If is_format_identifier_valid flag is 1 , 
it indicates that the field of format_identifier is effective. If is_format_identifier_valid flag is 1 , it indicates that the 
format_identifier field is valid. If is_original_networkJD_valid flag is 1, it indicates that the field of 
is_transport_stream_ID -valid is valid. If the flag is_transport_stream_ID-valid is 1, it indicates that the 
transport_streamJD field is valid. If is_servece_ID_valid flag is 1 , it indicates that the serveceJD field is valid. 
[0214] If is_country_code_valid flag is 1 , it indicates that the field country_code is valid. The 32-bit field of 
f ormat_identifier indicates the value of f ormat_identifier owned by a registration descriptor (defined in ISO/I EC1 381 8-1 ) 
in the transport stream. The 16-bit field of original_networkJD indicates the value of the original_network_ID defined 
in the transport stream. 

[0215] The 16-bit field in serveceJD denotes the value of serveceJD defined in the transport stream. The 24-bit 
field of country_code shows a country code defined by IS03166. Each character code is encoded by IS08859-1 . For 
example, Japan is represented as "JPN" and is encoded to "0x4A 0x50 0x4E". The stream_format_name is 1 5 character 
codes of ISO-646 showing the name of a format organization affording stream definitions of transport streams. An 
invalid byte in this field has a value of 'OxFP. 
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[0216] Formatjdentifier, original_network_ID, transport_streamJD, serveceJD, country_code and 
stream Jormat_name indicate service providers of transport streams. This allows to recognize encoding limitations on 
audio or video streams and stream definitions of private data streams other than audio video streams or SI (service 
information). These information can be used to check if the decoder is able to decode the stream. If such decoding is 
possible, the information may be used to initialize the decoder system before starting the decoding. 
[0217] STCJnfo is now explained. The time domain in the MPEG-2 transport stream not containing STC discontin- 
uous points(discontinuous points of the system time base) is termed the STC_sequence. In the Clip, STC_sequence 
is specified by the value of STC_sequence_id. Figs.50A and SOB illustrate a continuous STC domain. The same STC 
values never appear in the same STC_sequence, although the maximum tine length of Clip is limited, as explained 
subsequently. Therefore, the same PTS values also never appear in the same STC_sequence. If the AV stream contains 
N STC discontinuous points, where N > 0, the Clip system time base is split into (N+1) STC_sequences. 
[0218] STCJnfo stores the address of the site where STC discontinuities (system timebase discontinuities) are pro- 
duced. As explained with reference to Fig.51 , the RSPN_STC_start indicates the address and begins at a time point 
of arrival of the source packet referenced by the (k+1)st RSPN_STC_start and ends at a time point of arrival of the 
last source packet. 

[021 9] Fig.52 shows the syntax of the STCJnfo. Turning to the syntax of STCJnfo shown in Fig.52, version_number 
is four character letters indicating the version number of STC_lnfo(). The version_number must be encoded to "0045" 
in accordance with ISO 646. 

[0220] Length is a 32-bit unsigned integer indicating the number of bytes of STCJnfoQ from directly after this length 
field to the end of STCJnfo. If CPIJype of CPI() indicates TU_map type, 0 may be set in this length field. If CPIJype 
of CPI() indicates EP_map type, the num_of_STC_sequence mut be of a value not less than 1 . 
[0221] An 8-bit unsigned integer of num_of_STC_sequence indicates the number of sequences in the Clip. This 
value indicates the number of the for-loops next following the field. The STC_sequence_id corresponding to the preset 
STC_sequence is defined by the order in which appears the RSPN_STC_start corresponding to the STC_sequence 
in the for-loop containing the RSPN_STC_start. The STC_sequenceJd commences at 0. 

[0222] The 32-bit field of RSPN_STC_start indicates an address at which the STC_sequence commences on the 
AV stream file. RSPN_STC_start denotes an address where system time base discontinuities are produced in the AV 
stream file. The RSPN_STC_start may also be a relative address of the source packet having the first PCR of the new 
system time base in the AV stream. The RSPN_STC_start is of a size based on the source packet number and is 
counted from the first source packet of the AV stream file with the offset_SPN value defined in CliplnfoQ as an initial 
value. In this AV stream file, the absolute address is calculated by the above-mentioned equation, that is 

SPN_xxx = RSPN_xxx - offset_SPN. 

[0223] Programlnfo in the syntax of zzzz.clip shown in Fig. 45 is now explained with reference to Fig.53. The time 
domain having the following features in the Clip is termed program_sequence. These feature are that the value of 
PCR_PID is not changed, the number of audio elementary streams is also not changed, the PID values in the respective 
video streams are not changed, the encoding information which is defined by VideoCodinglnfo thereof is not changed, 
the number of audio elementary streams is also not changed, the PID values of the respective audio streams are not 
changed, and that the encoding information, which is defined by AudioCodinglnfo thereof, is not changed. 
[0224] Program_sequence has only one system time base at the same time point. Program_sequence has a sole 
PMT at the same time point. Programlnfo() stores the address of the site where the program_sequence commences. 
RSPN_program_sequence-start indicates the address. 

[0225] Fig.54 illustrates the syntax of Programlnfo. Turning to the Programlnfo shown in Fig.54, version_number is 
four character letters indicating the version number of Programlnfo(). The version_number must be encoded to "0045" 
in accordance with ISO 646. 

[0226] Length is a 32-bit unsigned integer indicating the number of bytes of ProgramlnfoQ from directly at back of 
this length field to the end of program(info(). If CPIJype of CPI{) indicates the TU_map type, this length field may be 
set to 0. If the CPIJype of CPI() indicates EP_map type, the number_of_programs must be of a value not less than 1 . 
[0227] An 8-bit unsigned integer of number_of_program_sequences denotes the number of program_sequences in 
the Clip. This value indicates the number of for-loops next following this field. If program_sequence in the Clip is not 
changed, 1 must be set in the number of program_sequences. A 32-bit field of RSPN_program_sequence_start is a 
relative address where the program sequence commences on the AV stream. 

[0228] RSPN_program_sequence_start is sized with the source packet number as a unit and is counted with the 
value of offset_SPN defined in the Cliplnfo() as from the first source packet of the AV stream file. An absolute address 
in the AV stream file is calculated by: 
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SPN_xxx = RSPN_xxx - offset_SPN. 

The values of RSPN_program_sequence_start in the for-loop syntax must appear in the rising order. 
[0229] A 1 6-bit field of PCR_PID denotes the PID of the transport packet containing an effective PCR field effective 
for the program_sequence. An 8-bit field of number_of_audios indicates the number of for-loops containing 
audio_stream_PID and AudioCodinglnfo(). A 16-bit field of video_stream_PID indicates the PID of the transport packet 
containing a video stream effective for its program_sequence. VideoCodinglnfo(), next following this field, must explain 
the contents of the video stream referenced by its video_stream_PID. 

[0230] A16-bitfieldof audio_stream_PID indicates the PID of a transport packet containing the audio stream effective 
for its program_sequence. The AudioCodinglnfo(), next following this field, must explain the contents of the video 
stream referenced by its audio_stream_PID. 

[0231] The order in which the values ofvideo_stream_PID in the for-loop of the syntax must be equal to the sequence 
of PID encoding of the video stream in the PMT effective for the program_sequence. Additionally, the order in which 
the values of audio_stream_PID appears in the for-loop of the syntax must be equal to the sequence of PID encoding 
of the audio stream in the PMT effective for the program_sequence. 

[0232] Fig. 55 shows the syntax of VideoCodinglnfo in the syntax of the Programlnfo shown in Fig. 54. Turning to the 
syntax of the VideoCoding Info shown in Fig. 55, an 8-bit field of video_format indicates the video format corresponding 
to video_stream_PID in Programlnfo(), as shown in Fig.56. 

[0233] Referring to Fig. 57, an 8-bit field of frame_rate indicates the video frame rate corresponding to the 
video_stream_PID in Programlnfo(). An 8-bit field of display aspect_ratio indicates a video display aspect ratio corre- 
sponding to video_stream_PID in Programlnfo(). 

[0234] Fig.59 shows the syntax of AudioCodinglnfo in the syntax of Programlnfo shown in Fig.54. Turning to the 
syntax of the AudioCoding Info shown in Fig.59, an 8-bit field of audio_format indicates the audio encoding method 
corresponding to audio_stream_PID in Programlnfo(), as shown in Fig. 60. 

[0235] An 8-bit field of audio_component_type indicates an audio component type corresponding to 
audio_stream_PID in Programlnfo() as shown in Fig. 61 , whilst an 8-bit field of sampling_frequency indicates an audio 
sampling frequency corresponding to audio_stream_PID in Programlnfo() as shown in Fig. 62. 

[0236] The CPI (Characteristics Point Information) in the syntax of zzzzz.clip shown in Fig. 45 is explained. The CPI 
is used for correlating the time information in the AV stream with the address in its file. The CPI is of two types, namely 
EP_map and TIMnap. In Fig. 63, if CPI_type in CPI() is EP_map, its CPI() contains EP_map. In Fig. 64, if CPLtype in 
CPI() is TLLmap, its CPI() contains TU_map. One AV stream has one EP_map or one TU_map. If the AV stream is 
an SESF transport stream, the corresponding Clip must own an EP__map. 

[0237] Fig. 65 show the syntax of CPI. Turning to the syntax of CPI shown in Fig. 65, the version_number is four 
character letters indicating the version number of this CPI(). The version_number must be encoded to "0045" in ac- 
cordance with ISO 646. Length is a 32-bit unsigned integer indicating the number of bytes as from directly after this 
length field to the trailing end of the CPI(). The CPLtype is a 1 -bit flag and indicates the CPI type of Clip, as shown in 
Fig. 66. 

[0238] The EP_map in the CPI syntax shown in Fig. 65 is explained. There are two types of the EP_map, that is 
EP_map for a video stream and an EP_map for an audio stream. The EP„map_type in the EP_map differentiates 
between these EP_map types. If the Clip contains one or more video streams, the EP_map for the video stream must 
be used. If the Clip does not contain a video stream but contains one or more audio streams, the EP_/nap for the audio 
stream must be used. 

[0239] The EP_map for a video stream is explained with reference to Fig. 67. The EP_map for the video stream has 
data stream_PID, PTS_EP_start and RSPN_EP_start. The stream_PID shows the PID of the transport packet trans- 
mitting a video stream. The PTS_EP_start indicates the PTS of an access unit beginning from the sequence header 
of the video stream. The RSPN_RP_start indicates the address of a source packet including the first byte of the access 
unit referenced by the PTS_EP_start in the AV stream. 

[0240] A sub table, termed EP_map_for_one_stream_PID() is created from one video stream transmitted by the 
transport packet having the same PID to another. If plural video streams exist in the Clip, the EP_map may contain 
p I u ra I E P_m ap_f o r_o n e_stream_P I D () . 

[0241] The EP_map for audio stream has data stream_PID, PTS_EP_start and RSPN_EP_start. The stream_PID 
shows a PID of a transport packet transmitting an audio stream. The PTS_EP_start shows the PTS of an accessing 
unit in the audio stream. The RSPN_EP-start indicates an address of a source packet containing a first byte of the 
access unit referenced by PTS_EP_start of the AV stream. 

[0242] The sub table termed EP_map_for_one_stream_PID() is created from one audio stream transmitted by the 
transport packet having the same PID to another. If there exist plural audio streams in the Clip, EP_map may contain 
plural E P_m ap_f o r_o n e_stream_P I D () . 
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[0243] Turning to the relation between EP_map and STCJnfo, one EP_map_for_one_stream_PID() is created in 
one table irrespective of discontinuous points in the STC. Comparison of the value of the RSPN_EP_start to the value 
of RSPN_STC_start defined in STCJnfoQ reveals the boundary of data of EP_map belonging to respective 
STC_sequences (see Fig. 68). The EP_map must have one EP_map_for_one_stream_PID for a continuous stream 

5 range transmitted by the same PID. In the case shown in Fig. 69, program#1 and program#3 have the same video PID, 
however, the data range is not continuous, so that EP_map_for_one_stream_PlD must be provided for each program. 
[0244] Fig. 70 shows the EP_map syntax. By way of explanation of the EP_jnap syntax shown in Fig. 70, the EP_type 
is a 4-bit field and shows the EP_map entry point type, as shown in Fig. 71 . The EP_type shows the semantics of the 
data field next following this field. If Clip includes one or more video stream, the EP_type must be set to 0 ('video'). 

10 Alternatively, if the Clip contains no video stream but contains one or more audio stream, then EP_type must be set 
to 1 ('audio'). 

[0245] The 16-bit field of number_of_stream_PIDs indicates the number of times of loops of the for-loop having 
number_of_stream_PIDs in the EP__map() as a variable. The 16-bit field of stream_PID(k) indicates the PID of the 
transport packet transmitting the number k elementary stream (video or audio stream) referenced by 
is EP_map_Jor_one_stream_PID (num_EP_entries(k)). If EP_type is 0 ('video'), its elementary stream must be a video 
stream. If EP_type is equal to 1 ('audio'), its elementary stream must be the audio stream. 

[0246] The 16-bit field of num_EP_entries(k) indicates the num_EP_entries(k) referenced by EP_map_entries(k)). 
The EP_map_for_one_stream_PID_Start_address(k): This 32- bit field indicates the relative address position at which 
the EP_map_for_one_stream_PID(num_EP_entries(k)) begins in the EP_map(). This value is indicated by the size as 
20 from the first byte of the EP_map(). 

[0247] Padding_word must be inserted in accordance with the EP_map() syntax. X and Y must be optional positive 
integers. The respective padding words may assume any optional values. 

[0248] Fig. 72 shows the syntax of EP_map_for_one_stream_PID. By way of explanation of the syntax of the 
EP_map_for_one_stream_PID shown in Fig. 72, the semantics of the 32-bit field of PTS_EP„start differs with the 
25 EP_type defined by EP_map(). If EP_type is equal to 0 (Video'), this field has upper 32 bits of the 33-bit precision PTS 
of the access unit beginning with a sequence header of the video stream. If the EP_type is equal to 1 ('audio'), this 
field has upper 32 bits of PTS of 33 bit precision of the access unit of the audio stream. 

[0249] The semantics of the 32-bit field of RSPN_EP_start differs with the EP_type defined in EP_map(). If EP_type 
is equal to 0 ('video'), this field indicates the relative address of the source packet including the first byte of the sequence 
30 header of the access unit referenced by the PTS_EP_start in the AV stream. Alternatively, if EP_type is equal to 1 
('audio'), this field indicates the relative address of the source packet containing the first byte in the audio stream of 
the access unit referenced by the PTS_EP_start in the AV stream. 

[0250] RSPN_EP_start is of a size which is based on the source packet number as a unit, and is counted from the 
first source packet of the AV stream file, with the value of the offset_SPN, defined in Cliplnfo(), as an initial value. The 
35 absolute address in the AV stream file is calculated by 

SPN_xxx = RSPN_xxx - offset_SPN. 

40 [0251] It is noted that the value of the RSPN_EP_start in the syntax must appear in the rising order. 

[0252] The TU_map is now explained with reference to Fig. 73. TU_map forms a time axis based on the source 
packet arrival time clock (timepiece of the arrive time base). This time axis is termed TU_map_time_axis. The point of 
origin of TU_map_time_axis is indicated by offset_time in the TU_map(). TU_map_time_axis is divided in a preset unit 
as from offset_time, this unit being termed time_unit. 

45 [0253] In each time_unit in the AV stream, addresses on the AV stream file of the source packet in the first complete 
form are stored in TU_map. These addresses are termed RSPN_time_unit_start. The time at which begins the k(k > 
0)th time__unit on the TU_map_time_axis is termed TU_start_time(k). This value is calculated based on the following 
equation: 

50 

TU_start_time(k) = offset_time + k*time_unit_size. 
[0254] It is noted that TU_start_time(k) has a precision of 45 kHz. 

[0255] Fig. 74 shows the syntax of TU_map. By way of explanation of the TU_map syntax shown in Fig. 74, the 32-bit 
55 field of offset_time gives an offset time relative to TU_map_time_axis. This value indicates the offset time relative to 
the first time_unit in the Clip. The offset_time is of a size based on 45 kHz clock derived from the 27 MHz precision 
arrival time clocks as unit. If the AV stream is to be recorded as new Clip, offset_time must be set to 0. 
[0256] The 32-bit field of time unit_size affords the size of the time_unit, and is based on 45 kHz clocks, derived from 
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the 27 MHz precision arrival time clocks, as unit. Preferably, time_unit_size is not longer than one second 
(time_unit_size < 45000). The 32 bit field of number_of_time_unit_entries indicates the number of entries stored in 
TU_map(). 

[0257] The 32-bit field of RSN_time_unit_start indicates the relative address of a site in the AV stream at which 
begins each time_unit. RSN_time_unit_start is of a size based on the source packet number as unit and is counted 
with the value of offset_SPN defined in Cliplnfo() as from the first source packet of the AV stream file as an initial value. 
The absolute address in the AV stream file is calculated by 

SPN_xxx - RSPN„xxx - offset_SPN. 

[0258] It is noted that the value of RSN_time_unit_start in the for-loop of the syntax must appear in the rising order. 
If there is no source packet in the number (k+1) time_unit, the number (k+1) RSN_time_unit_start must be equal to 
the number k RSPN_time_unit_start. 

[0259] By way of explanation of the ClipMark in the syntax of zzzzz.clip shown in Fig.45, the ClipMark is the mark 
information pertinent to clip and is stored in the ClipMark. This mark is not set by a user, but is set by a recorder 
(recording and/or reproducing apparatus 1). 

[0260] Fig. 75 shows the ClipMark syntax. By way of explanation of the ClipMark syntax shown in Fig.75, the 
version_number is four character letters indicating the version number of this ClipMark. The version_number must be 
encoded in accordance with ISO 646 to "0045". 

[0261 ] Length is a 32-bit unsigned integer indicating the number of bytes of the CIipMark() as from directly after the 
length field to the trailing end of ClipMarkQ. The number_of_Clip_marks is a 16-bit unsigned integer indicating the 
number of marks stored in ClipMark and may be equal to 0. Mark_type is an 8-bit field indicating the mark type and is 
encoded in accordance with the table shown in Fig. 76. 

[0262] Mark_time_stamp is a 32-bit field and stores the time stamp indicating a pointer having a specified mark. The 
semantics of mark_time_stamp differs with CPI_type in the PlayList0, as shown in Fig. 77. 

[0263] If CPI_type in CPI() indicates the EP_map type, this 8-bit field indicates the STC_sequence_id of the contin- 
uous STC domain where there is placed mark_time_stamp. If CPI_type in CPI() indicates TU_map type, this 8-bit field 
has no meaning but is set to 0. The 8-bit field of Character_set indicates the indicating method of character letters 
encoded in the mark_name field. The encoding method corresponds to the value shown in Fig. 19. 
[0264] The 8-bit field of namejength indicates the byte length of the mark name shown in the mark_name field. This 
mark_name field indicates the mark name. The byte number corresponding to the number of the namejength from 
left of this field is the effective character number and denotes the mark name. In the mark_name field, the values next 
following these effective character letters may be arbitrary. 

[0265] The field of ref_thumbnail_index indicates the information of the thumbnail picture appended to the mark. If 
the ref_thumbnaiLindex field is of a value different from OxFFFF a thumbnail picture is added to its mark, with the 
thumbnail picture being stored in the mark.thumb file. This picture is referenced using the value of ref__thumbnail_index 
in the mark.thumb file. If the ref_thumbnail_index field is of a value equal to OxFFFF a thumbnail picture is not appended 
to its mark. 

[0266] Fig. 78 shows another syntax of ClipMark which takes the place of Fig.75. Fig. 79 shows a typical table of 
mark_type which takes the place of Fig. 76 in such case. Reserved_forjTiakerJD is a 16-bit field indicating the maker 
ID of the maker which, if mark__type indicates a value from OxCO to 0XFF, defines the mark_type. The maker ID is 
specified by a DVR format licensor. Mark_entry() is the information indicating the point specified to a mark-point. The 
syntax will be explained subsequently in detail. Representative_picture_entry is the information indicating the point of 
the information representative to the mark shown by mark_entry(). The syntax of representative_picture_entry will be 
explained subsequently in detail. 

[0267] ClipMark is used in order to enable a user reproducing an AV stream to retrieve the contents visually. A DVR 
player uses a GUI (graphical user interface) to present the ClipMark information to the user. For visual display of the 
ClipMark information, a picture indicated by representative_picture_entry(), rather than the picture indicated by 
mark_entry(), is to be indicated. 

[0268] Fig. 80 shows examples of mark_entry() and representative_picture_entry. It is assumed that a program name 
(title) of a certain program is displayed after some time, such as several seconds, as from the start of such program. 
If ClipMark is to be created, mark_entry() may be put at a start point of the program, with the 
representative_picture_entry() then being put at a point of display of the program name (title) of the program. 
[0269] If the picture of representative_picture_entry is displayed on the GUI and the user has specified the picture, 
the DVR player starts replay as from the point where mark_entry is put. 
[0270] Fig. 81 shows the syntax of mark_entry() and representative_picture_entry(). 

[0271] Mark_time_stamp is a 32-bit field and stores a time stamp indicating a point specified by the mark and a time 
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stamp indicating a point of a picture representing a mark specified by mark_entry(). 

[0272] Fig. 82 shows a typical syntax of mark_entry() and representative_picture_entry() in case of employing the 

address based information rather than employing the information of the time stamp base by the PTS. 

[0273] In case of mark_entry(), RSPN_ref_EP_start indicates the relative address of a source packet indicating the 

5 entry point of a stream for decoding a picture of a mark point in the AV stream. In case of representatives icture_entry 
(), RSPN_ref_EP_start indicates the relative address of a source packet indicating the entry point of a stream for 
decoding a picture representative of a mark indicated by mark_entry(). The value of RSPN_ref_EP_start must be stored 
in the EP_map as RSPN_ref_EP_start. Moreover, the value of PTS_EP_start corresponding to the RSPN_EP_start 
must be closest in the EP_map in the past as from the PTS of the mark point picture. 

10 [0274] Offset_nm_pictures is a 32-bit field and indicates the number of pictures of offset from the picture referenced 
by RSPN_ref_EP_start to the picture indicated by the mark point in the display sequence. This number is counted from 
0. In the case of the embodiment of Fig. 83, offset_nm_pictures is 6. 

[0275] Fig. 84 shows another typical syntax of mark_entry() and representative_picture_entry() in case of employing 
the address based information for specifying ClipMark. 
is [0276] In case of mark_entry(), RSPN_mark_point indicates the relative address of the source packet including the 
first byte of the access unit referenced by the mark in the AV stream. In case of representative_picture_entry(), 
RSPN_mark_point indicates the relative address of the source packet including the first byte of the encoded picture 
representative of the mark indicated by the mark_entry(). 

[0277] RSPN_mark_point is of a size having the source packet number as a unit and is counted using the value of 
20 offset„SPN defined in the Clip Information file from the first source packet of the AV stream file. 

[0278] Referring to Fig. 85, the relation between ClipMark and EP_map is explained. In the present embodiment, it 
is assumed that EP_map specifies 10, 11 and In as addresses of the entry point, and that the an l-picture continuing to 
the sequence header from these addresses is started. If ClipMark specifies M1 as an address of a certain mark, and 
if the picture beginning from this source packet is to be decodable, it suffices if data readout is initiated from 11 as an 
25 entry point ahead of and closest to the M1 address. 

[0279] MakerPrivateData has already been explained with reference to Fig.22 and hence is not explained here spe- 
cifically. 

[0280] Next, thumbnaiUnformation is explained. A thumbnail picture is stored in a menu.thmb file or in a mark.thmb 
file. These files are of the same syntax structure and own a sole Thumbnail (). The menu.thmb file stores a picture 

30 representing respective PlatyLists. The totality of menu thumbnails are stored in the sole menu.thmb file. 

[0281] The mark.thmb file stores a mark thumbnail picture, that is a picture representing a mark point. The totality 
of mark thumbnails corresponding to the totality of PlayLists and Clips are stored in the sole mark.thmb file. Since the 
thumbnails are frequently added or deleted, the operation of addition and partial deletion must be executable readily 
and speedily. For this reason, Thmbnail() has a block structure. Picture data is divided into plural portions each of which 

35 js stored in one tn_block. One picture data is stored in consecutive tn_blocks. In the string of tn_blocks, there may 
exist a tn_block not in use. The byte length of a sole thumbnail picture is variable. 

[0282] Fig. 86 shows the syntax of the menu.thmb file and the mark.thmb file, whilst Fig. 87 shows the syntax of the 
thumbnail syntax in the syntax of the menu.thmb file and the mark.thmb file. By way of explanation of the Thmbnail 
syntax shown in Fig. 87, the version_number is the four character letters indicating the version number of this Thumbnail 

40 (). The version_n umber must be encoded to "0045" in accordance with ISO 646. 

[0283] Length is a 32-bit unsigned integer indicating the number of bytes of makers_private_data as from directly 
after the length field to the trailing end of Thumbnail(). Tn_block_start_address is a 32-bit unsigned integer indicating 
the leading byte address of the first tn_block, with the number of relative bytes as from the leading byte of Thumbnail 
() as unit. The number of relative bytes is counted from 0. Number_of_thumbnails is a 1 6-bit unsigned integer affording 

45 the number of entries of the thumbnail picture contained in the Thumbnail(). 

[0284] Tn_block_size is a 16-bit unsigned integer affording one tn_block size with 1024 bytes as unit. For example, 
if tn_block_size = 1 , it indicates that the size of one tn_block is 1 024 bytes. N urn be r_of_tn_b locks is a 1 6-bit unsigned 
integer representing the number of entries of tn_blocks in theThumbnail() . Thumbnail_index is a 1 6-bit unsigned integer 
representing the index number of a thumbnail picture represented by the thumbnail information corresponding to one 

so for-loop beginning from this thumbnail_index field. The value of OxFFF must not be used as thumbnaiUndex. The 
thumbnail_index is referenced by ref_thumbnaiLindex in the UIApptnfVolume(), UIApplnfoPlayList(), PlayListMark() 
and ClipMarkQ. 

[0285] Thumb nail_picture_format is an 8-bit unsigned integer representing the picture format of a thumbnail picture 
and assumes a value shown in Fig. 88. In the table, DCF and PNG are allowed only in "menujhumb". The mark 
55 thumbnail must assume a value "0x00" (MPEG-2 Video l-picture). 

[0286] Picture_data_size is a 32-bit unsigned integer indicating the byte length of a thumbnail picture in terms of a 
byte as a unit. Start_tn_block_n umber is a 1 6-bit unsigned integer indicating the tn_block number of the tn_block where 
begins the thumbnail picture data. The leading end of the thumbnail picture data must be coincident with the leading 
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end of tn _block. The tn_block number begins with 0 and is relevant to the value of a variable k in the f or-loop in tn jDlock. 
[0287] X_picture_length is a 16-bit unsigned integer representing the number of pixels in the horizontal direction of 
a frame picture of the thumbnail picture. Y_picture_length is a 16-bit unsigned integer representing the number of 
pixels in the vertical direction of a frame picture of the thumbnail picture. Tn_block is an area in which a thumbnail 
5 picture is stored. The totality of tn_blocks in Thumbnail() are of the same size (fixed length) and are defined in size by 
tn_block_size. 

[0288] Figs.89A and 89B schematically show how thumbnail picture data are stored in tn_block. As shown in Figs. 
89A and 89B, the respective thumbnail picture data begin from the leading end of tn_block and, if the picture data are 
of a size exceeding tn_block, the next consecutive tn_block is used for storage. By so doing, picture data with a variable 
10 size may be managed as fixed length data such that editing such as deletion can be coped with by a simpler processing 
operation. 

[0289] An AV stream file is now explained. The AV stream file is stored in "M2TS" directory (Fig. 14). There are two 
types of the AV stream files, one being a CLIP AV stream and the other being a Bridge-Clip AV stream. Both streams 
must be of a DVR MPEG-2 transport stream file structure as hereinafter defined. 

15 [0290] First, the DVR MPEG-2 transport stream is explained. The structure of DVR MPEG-2 transport stream is as 
shown in Fig.90. The AV stream file has the structure of a DVR MPEG-2 transport stream constructed by an integer 
number of Aligned units. The Aligned unit has a size of 6144 bytes (= 2048x3 bytes). The Aligned unit begins at the 
first byte of the source packet having a length of 192 bytes. One source packet is made up of TP_extra_header and 
a transport packet. The TP_extra_header is 4 bytes long, with the transport packet being 188 bytes long. 

20 [0291] One Aligned unit is made up of 32 source packets. The last Aligned unit in the DVR MPEG-2 transport stream 
is also made up of 32 source packets. So, the DVR MPEG-2 transport stream ends at a boundary of the Aligned unit. 
If the number of the transport packets of the input transport stream recorded on the disc is not a multiple of 32, the 
source packet having a null packet (transport packet of PID = 0x1 FFF) must be used as the last Aligned unit. The file 
system is not allowed to add redundant information to the DVR MPEG-2 transport stream. 

25 [0292] Fig.91 shows a recorder model of the DVR MPEG-2 transport stream. The recorder shown in Fig. 91 is a 
conceptual model for prescribing a recording model. The DVR MPEG-2 transport stream follows this model. 
[0293] The input timing of the MPEG-2 transport stream is explained. The input MPEG-2 transport stream is a full 
transport stream or a partial" transport stream. The input MPEG-2 transport stream must obey the ISO/IEC13818-1 or 
ISO/I EC1 381 8-9. The number i byte of the MPEG-2 transport stream is input at time t(i) simultaneously to both aT-STD 

30 Transport stream system target decoder as provided in ISO/IEC 13818-1) 51 and a source packetizer 54. Rpk is an 
instantaneous maximum value of the input rate of the transport packet. 

[0294] A 27 MHz PLL52 generates a frequency of 27 MHz clocks. The frequency of 27 MHz clocks is locked at a 
value of PCR (program clock reference) of the MPEG-2 transport stream. An arrival time lock counter 53 is a binary 
counter counting pulses of the frequency of 27 MHz. The arrival_time_clock(i) is a count value of the arrival time clock 
35 counter 53 at time t(i). 

[0295] The source packetizer 54 appends TP_extra_header to the totality of transport packets to create a source 
packet. Arrival_time_stamp indicates the time at which the first byte of the transport packet arrives at both the T-STD 
51 and the source packetizer 54. Arrival_time_stamp(k) is a sample value of the arrival_time_clock(k), as indicating 
by the following equation: 

40 

arrivaLtime_clock(k) _ arrival_time_clock(k)% 230 

where k indicates the first byte of the transport packet. 
45 [0296] If the time interval of two consecutively input transport packets amounts to not less than 230/27000000 sec 
(about 40 sec), the difference between the arrival_time_clock(k) of the two transport packets should be set to 
230/27000000 sec. The recorder is provided for such case. 

[0297] A smoothing buffer 55 smooths the bitrate of the input transport stream. Th smoothing buffer 55 is not allowed 
to overflow. Rmax is an output bitrate of a source packet from the smoothing buffer 55 when the smoothing buffer 55 
so is not void. If the smoothing buffer 55 is void, an output bitrate of the smoothing buffer 55 is 0. 

[0298] The parameters of a recorder model of the DVR MPEG-2 transport stream are hereinafter explained. The 
value of Rmax is given by TS_recording_rate defined in Cliplnfo() associated with the AV stream file. This value may 
be calculated in accordance with the following equation: 

55 

Rmax = TS_recording_rate*1 92/1 88 
where the value of TS_recording_rate is of a size based in bytes/second as unit. 



25 



EP 1 280 348 A1 



[0299] If the input transport stream is an SESF transport stream, Rpk must be eq uaf to TS_recording_rate as defined 
in Cliplnfo() associated with the AV stream file. If the input transport stream is not an SESF transport stream, the values 
defined in descriptors of the MPEG-2 transport stream, such as smoothing_buffer_descriptor, 
short_smoothing_bugger_descriptor or partiaLtransport_stream_descriptor, may be referenced as to the size of the 
5 smoothing buffer 55. 

[0300] If the input transport stream is an SESF transport stream, the size of the smoothing buffer 55 (smoothing 
buffer size) is zero. If the input transport stream is not an SESF transport stream, the values defined in descriptors of 
the MPEG-2 transport stream, such as smooth ing_buffer_descriptor, short_smoothing_bugger_descriptor or 
partial_transport_stream_descriptor I may be referenced as to the size of the smoothing buffer 55. 
10 [0301] For a recorder and a reproducing unit, a buffer of a sufficient size needs to be used. The default buffer size 
is 1536 bytes. 

[0302] A player model of the DVR MPEG-2 transport stream is now explained. Fig. 92 shows a player model of the 
DVR MPEG-2 transport stream. This is a conceptual model for prescribing the replay process. The DVR MPEG-2 
transport stream obeys this model. 
15 [0303] A 27MHz X-tal (crystal oscillator) 61 generates a 27MHz frequency. The error range of the 27 MHz frequency 
must be ± 30 ppm (27000000+81 0 Hz). An arrival time clock counter 62 is a binary counter counting pulses of 27 MHz. 
ArrivaLtime_clock(i) is a count value of the arrival time clock counter 62 at time t(i). 

[0304] In the smoothing buffer 64, Rmax is an input bitrate of a source packet to the smoothing buffer 64 when the 
smoothing buffer 64 is not full. If the smoothing buffer 64 is full, the input bitrate to the smoothing buffer 64 is 0. 
20 [0305] By way of explaining the output timing of the MPEG-2 transport stream, if arrival_time_stamp of the current 
source packet is equal to the value of the 30 LSB bits of arnval_time_clock(i), the transport packet of the source packet 
is extracted from the smoothing buffer 64. Ppk is an instantaneous maximum value of the transport packet rate. The 
smoothing buffer 64 is not allowed to overflow. 

[0306] The parameters of the player model of the DVR MPEG-2 transport stream are the same as the parameters 

25 of the recorder model of the above-described DVR MPEG-2 transport stream. 

[0307] Fig. 93 shows the syntax of the Source packets, while transport_packet() is an MPEG-2 transport packet as 
prescribed in ISO/IEC 13818-1 . Fig. 94 shows a TP_extra_header in the syntax of the source packet shown in Fig. 93. 
By way of explaining the syntax of the TP_extra_header shown in Fig. 94, copy_permissionjndicator is an integer 
representing copy limitations of the payload of the transport packet. The copy limitations may be set to copy free, no 

30 more copy, copy once or copy prohibited. Fig. 95 shows the relation between the values of copy_perrnission_indicator 
and the modes these values represent. 

[0308] Copy jermission_indicator is appended to the totality of transport packets. If an input transport stream is to 
be recorded using the IEEE1394 digital interface, the value of copy_permission_indicator may be correlated with the 
value of EMI (encryption mode indicator) in the IEEE1394 isochronous packet header. If the input transport stream is 
35 recorded without using the IEEE 1394 digital interface, the value of copy_permission_indicator may be correlated with 
the value of the CCI embedded in the transport packet. In self-encoding an analog signal input, the value of the 
copy_permission_jndicator may be correlated with the value of CGMS-A of the analog signal. 

[0309] It is noted that arrivaLtime_stamp is an integer value specified by arrivaLtime_stamp in the following equation: 

40 

arrival_time_stamp(k) = arrival_time_clock(k)%230. 

[0310] In defining a Clip AV stream, the Clip AV stream must have the structure of the DVR MPEG-2 transport stream 
defined as described above. ArrivaL_time_clock(i) must increase continuously in a Clip AV stream. If a discontinuous 
45 point of the system time base (STC base) should exist in the Clip AV stream, the arriva!_time_clock(i) of the Clip AV 
stream must increase continuously. 

[0311] The maximum value of the difference of the arrivaLtime_cIock(i) between the beginning and the end in the 
Clip AV stream must be 26 hours. This limitation guarantees that, if there is no discontinuous point of the system time 
base (STC base) in the MPEG-2 transport stream, no PTS (presentation time stamp) of the same value never appears 
so in the Clip AV stream. The MPEG2 system standards provide that the wraparound time of PTS be 233/90000 sec 
(approximately 26.5 hours). 

[0312] In defining the Bridge-Clip AV stream, the Bridge-Clip AV stream must have a structure of a DVR MPEG-2 
transport stream defined as described above. The Bridge-Clip AV stream must include one arrival time base discon- 
tinuous point. The transport stream ahead and at back of the arrival time base discontinuous point must obey the 
55 limitations on encoding as later explained and also must obey the DVR-STD as later explained. 

[0313] The present embodiment provides for seamless support of the seamless connection of the video and audio 
between Playltems in editing. This seamless connection between the Playltems guarantees the "continuous data sup- 
ply" to the player/recorder and "seamless decoding". The "continuous data supply" means the capability of a file system 
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in assuring data supply to a decoder at a bitrate necessary for prohibiting buffer underflow. The "continuous data supply" 
assures data storage continuously in terms of a block of a sufficient size to assure real-time properties and data readout 
from the disc as a unit. 

[0314] The "seamless decoding" means the capability of the player in demonstrating audio/video data recorded on 

5 the disc without producing pause or gap in the decoder replay output. 

[0315] The AV stream referenced by the seamlessly connected Play Item is explained. Whether or not the connection 
between the previous Playltem and the current Playltem is guaranteed to enable seamless display can be verified from 
the connection_condition field defined in the current Playltem. There are a method for seamless connection between 
Playltems employing Bridge-Clip and a method for seamless connection between Playltems not employing Bridge-Clip. 

w [0316] Fig. 96 shows the relation between the previous Playltem and the current Playltem. Fig. 96 shows TSI made 
up of a shaded stream data of Clipl (Clip AV stream) and shaded stream data previous to 
RSPN_arrival_time_discontinuity of Bridge-Clip. 

[0317] The shaded stream data of Clipl of TSI is stream data from an address of a stream necessary for decoding 
a presentation unit associated with IN_time of the previous Playltem (shown by IN_time in Fig. 96) to a source packet 
15 referenced by RSPN_exit_from__previous„Clip. The shaded stream data previous to RSPN_arrivaLtime_discontinuity 
contained in TSI is stream data from the first source packet of Bridge-clip to a source packet referenced by 
RSPN_arrival_time_discontinuity. 

[0318] On the other hand, TS2 in Fig.96 is made up of shaded stream data of Clip2 (Clip AV stream) and shaded 
stream data subsequent to PN_arrivaLtime_discontinuity of Bridge-Clip. The shaded stream data subsequent to 
20 RSPN_arrival_time_discontinuity of Bridge-Clip contained in TS2 is stream data from the source packet referenced 
by RSPN_enterJo_current_Clip to an address of a stream required fir decoding the presentation unit associated with 
OUTJime of the current Playltem (shown in OUT_time2 in Fig.96). 

[0319] Fig. 97 shows the relation between the previous Playltem and the current Playltem in case of not employing 
Bridge-Clip. In this case, the stream data read out from the player is shown shaded. In Fig. 97, TS1 the shaded stream 

25 data of TS1 is data beginning from the address of a stream necessary for decoding the presentation unit associated 
with IN_time of previous Playltem (shown with IN_time1 in Fig.97) and extending up to the last source packet of Clipl . 
On the other hand, TS2 in Fig.97 is comprised of shaded stream data of Clip2 (Clip AV stream). 
[0320] The shaded stream data of Clip2 of TS2 is stream data beginning at a first source packet of Clip2 and ending 
at an address of a stream necessary for decoding the presentation unit associated with the OUTJime of the current 

30 Playltem (shown OUT_time2 in Fig.97). 

[0321] In Figs. 96 and 97, TS1 and T2 are continuous streams of source packets. Next, stream prescriptions of TS1 
and TS2 and connection conditions therebetween are considered. As a limitation of the encoding structure of the 
transport stream, the number of video streams contained in TS1 and TS2 must be 1 . The number of audio streams 
contained in TS1 and TS2 must be 2 or less. The numbers of audio streams contained in TS1 and TS2 must be equal 

35 to each other. It is noted that TS1 and/or TS2 may contain an elementary stream or a private stream other than those 
mentioned above. 

[0322] Limitations on a video bitstream are now explained. Fig. 98 shows an instance of seamless connection shown 
in the picture display sequence. In order that a video stream may be represented seamlessly, unneeded pictures 
represented at back of OUTJime (OUTJime of Clipl) and ahead of the IN_time2 (1N_time of Clip2) must be removed 

40 by a process of re-encoding a partial stream of the Clip near a junction point. 

[0323] Fig. 99 shows an instance of realizing seamless connection using BridgeSequence in the case shown in Fig. 
98. The video stream of Bridge-Clip previous to RSPN_arrival_time_discontinuity is made up of encoded video streams 
up to the picture associated with the OUT_time1 of Clipl of Fig. 98. This video stream is connected to the video stream 
of the previous Clipl and is re-encoded so as to prove one continuous elementary stream conforming to the MPEG2 

45 standard. 

[0324] In similar manner, the video stream of Bridge-Clip subsequent to RSPN_arrival_time_discontinuity is made 
up of an encoded video stream as from a picture corresponding to IN_time2 of Clip2 of Fig. 98. The video stream can 
start decoding correctly, is connected to the video stream of the next following Clip2 and is re-encoded so as to become 
one continuous elementary stream conforming to MPEG2 standard. For creating Bridge_Clip, several pictures in gen- 

50 eral need to be re-encoded, while other pictures can be copied from the original clip. 

[0325] Fig. 100 shows a case of realizing seamless connection without using BridgeSequence in the case of the 
embodiment shown in Fig. 98. The video stream of Clipl is made up of an encoded video stream up to a picture asso- 
ciated with OUTJimel of Fig. 98 , this being re-encoded so as to become one continuous elementary stream conforming 
to MPEG2 standard. Similarly, the video stream of Clip2 is comprised of an encoded video stream subsequent to the 

55 picture associated with the IN-time2 of Clip2 of Fig. 98, this being re-encoded so as to become one continuous ele- 
mentary stream conforming to MPEG2 standard. 

[0326] By way of explaining encoding limitation of the video stream, the frame rate of TS1 and TS2 video streams 
must be equal to each other. The TS1 video stream must end in sequence_end_code, while the TS2 video stream 
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must start in sequence_header, GOP header and in an l-picture. TheTS2 video stream must start in closed GOP. 
[0327] The video presentation unit (frame or field) as defined in a bitstream must be continuous with a junction point 
in-between. At the junction point, the top/bottom field sequence must be continuous. In the case of encoding employing 
the 3-2 pulldown, it may be necessary to rewrite "top_field_first 1 ' and "repeat_first_field", while local re-encoding may 
5 also be made to prevent occurrence of a field gap. 

[0328] By way of explaining encoding limitations on an audio bitstream, the audio sampling frequency of TS1 and 
that of TS2 must be equal to each other. The audio encoding method, such as MPEG1 layer 2, AC-3, SESF, LPCM 
and AAC of TS1 and TS2 must be equal to each other. 

[0329] By way of explaining coding limitations on the MPEG-2 transport stream, the last audio frame of the TS1 audio 
10 stream must contain an audio sample having a display time equal to that at display end time of the display picture of 
TS1 . The first audio frame of TS2 audio stream must contain an audio sample equal to that at display start time of the 
first display picture of TS2. 

[0330] At a junction point, there must not be a gap in the sequence of the audio presentation unit. As shown in Fig. 
101, there may be an overlap defined by the length of the audio presentation unit not larger than two audio frame 
15 domain. The first packet transmitting the TS2 elementary stream must be a video packet. The transport stream at the 
junction point must obey the DVR-STD which will be explained subsequently. 

[0331] By way of explaining the limitations of Clip and Bridge-Clip, TS1 and TS2 must not contain discontinuous 
points of the arrival time base therein. 

[0332] The following limitations are applied only to the case of using Bridge-Clip. The Bridge_Clip AV stream has 
20 only one arrival time base discontinuous point at a junction point between the last source packet of TS1 and the first 
source packet of TS2. The RSPN_arrivaLtime_discontinuity as defined in Clip lnfo() must indicate an address of the 
discontinuous point which in turn must indicate the address referencing the first source packet of TS2. 
[0333] The source packet referenced by the RSPN_exit_from_previous_Clip as defined in BridgeSequencelnfo{) 
may be any source packet in the Clip. It is unnecessary for this source packet to be a boundary of the Aligned unit. 
25 The source packet referenced by the RSPN_exit_from_current_Clip as defined in BridgeSequencelnfo() may be any 
source packet in the Clip. It is unnecessary for this source packet to be a boundary of the Aligned unit. 
[0334] By way of explaining limitations on the Playltem, the OUT_time of previous Playltem (OUTJime 1 shown in 
Figs. 96 and 97) mist indicate the display end time of the last presentation unit of TS1. The IN_time of the current 
Playltem (IN_time2 shown in Figs. 96 and 97) must indicate the display start time of the first video presentation unit of 
30 TS2. 

[0335] By way of explaining the limitations on data allocation in using the Bridge-Clip with reference to Fig. 102, 
seamless connection must be created such as to guarantee continuous data supply by the file system. This must be 
done by arraying the Bridge-Clip AV stream, connected to the Clipl (Clip AV stream file) and Clip2 (Clip AV stream 
file), such as to satisfy the data allocation prescriptions. 

35 [0336] The RSPN_exit_from_previous_Clip must be selected so that the stream portion of the Clipl (Clip AV stream 
file) previous to RSPN_exit_from_previous_Clip (Clip AV stream file) will be arrayed in a continuous area not smaller 
than one-half fragment. The data length of the Bridge-Clip AV stream must be selected so as to be arrayed in a con- 
tinuous area not smaller than one-half fragment. RSPN_enter_to_current_Clip must be selected so that the stream 
portion of Clip2 (Clip AV stream file) subsequent to PN_enter_to_current_Clip is arrayed in a continuous area not 

40 smaller than one-half fragment. 

[0337] By way of explaining the limitations of data allocation in case of seamless connection without using Bridge-Clip, 
seamless connection must be created so that continuous data supply will be guaranteed y the file system. This must 
be carried out by arraying the first portion of last portion of the Clipl (Clip AV stream file) and the first portion of Clip2 
(Clip AV stream file) such as to meet the data allocation prescriptions. 

45 [0338] The first stream portion of Clipl (Clip AV stream file) must be arranged in a continuous area not shorter than 
one half fragment. The first stream portion of Clip2 (AV stream file) must be arranged in a continuous area not smaller 
than one half fragment. 

[0339] Next, the DVR-STD is explained. The DVR-STD is a conceptual model for modeling the decoding processing 
in generating and verifying the DVR MPEG-2 transport stream. On the other hand, the DVR-STD is a conceptual model 
so for modeling the decoding processing in generating and verifying the AV stream referenced by two Play Items connected 
seamlessly as described above. 

[0340] Fig. 104 shows a DVR-STD model. The model shown in Fig. 104 contains, as a constituent element, a DVR 
MPEG-2 transport stream player model. The notation such as n, Tbn, Mbn, Ebn, Tbsys, Bsys, Rxn, Rbxn, Rxsys, Dn, 
Dsys, On and Pn(k) are the same as those defined in T-STD of ISO/IEC1381 88-1 . That is, n is an index number of an 
55 elementary stream and Tbn is a transport buffer of an elementary stream n. 

[0341] Mbn is a multiplex buffer of the elementary stream n, and is present only in a video stream. Ebn is an ele- 
mentary stream buffer in the elementary stream n. It is present only for the video stream. Tbsys is an input buffer for 
the system information in a program being decoded. Bsys is a main buffer in a system target decoder for the system 
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information for a program being decoded. Rbxn is a transmission rate with which data is removed from Mbn and is 
present only for a video stream. 

[0342] Rxsys is a transmission rate with which data is removed from Tbsys. Dn is a decoder in an elementary stream 
n. Dsys is s decoder pertinent to the system information of a program being decoded. On is a re-ordering buffer in a 

5 video stream. Pn(k) is a number k presentation unit in the elementary stream n. 

[0343] The decoding process for DVR-STD is explained. During the time a sole DVR MPEG-2 transport stream is 
being reproduced, the timing the transport packet is input to the Tbsys buffer is determined by the arrivaLtime_stamp 
of the source packet. The prescriptions for the decoding operation and the display operation are the same as those 
prescribed in ISO/IEC 13818-1 . 

10 [0344] The decoding process while the seamlessly connected Playltem is being reproduced is explained. Here, 
reproduction of two AV streams, referenced by the seamlessly connected Playltem, is explained. In the following ex- 
planation, the reproduction of the aforementioned TS1 And TS2, shown for example in Fig. 96, is explained. Meanwhile, 
TS1 and TS2 are a previous stream and a current stream, respectively. 

[0345] Fig. 105 shows a timing diagram for inputting, decoding and displaying a transport packet when transferring 
15 from a given AV stream (TS1 ) to the next AV stream (TS2) seamlessly connected thereto. During the time of transferring 
from a preset AV stream (TS1 ) to the next AV stream (TS2), connected seamlessly thereto, the time axis of the arrival 
time base of TS2 (indicated STC2 in Fig. 1 05) is not the same as the time axis of the arrival time base of TS1 (indicated 
STC1 in Fig.105). 

[0346] Moreover, the time axis of the system time base of TS2 (indicated STC2 in Fig.105) is not the same as the 
20 time axis of the system time base of TS1 (indicated STC1 in Fig.105). Video display is required to be seamlessly 
contiguous, while there may be overlap in the display time of the audio presentation unit. 

[0347] The input timing to the DVR-STD is explained. Until the time T1 , that is the time until the last video packet of 
TS1 is completely input to the TB1 of DVR-STD, the input timing to the TB1 , Tbn or TBsys of DVR-STD is determined 
by the arrival_time_stamp of the source packet of TS1 . 
25 [0348] The remaining packet of TS1 must be input to the buffer of Tbn or Tbsys at a bitrate of TS_recording_rate 
(TS1), where TS_recording_rate (TS1) is a value of the TS_recording_rate defined in the Cliplnfo() associated with 
the Clipl . The time the last byte of TS1 is input to the buffer is the time T2. Thus, in the domain from time T1 until time 
T2, arrival_time_stamp of the source packet is discounted. 

[0349] If N1 is the number of bytes of the transport packet next following the last video packet of TS1 , the time DT1 
30 from time T1 until time DT1 is the time required for N1 bytes to be completely input at a bitrate of TS_recording_rate 
(TS1), and is calculated in accordance with the following equation: 

DT1 = T2 - T1 = N1/TS_recording_rate (TS1). 

35 

[0350] During the time as from T1 until T2, the values of Rxn and Rxsys are both changed to the value of 
TS_recording_rate (TS1). The buffering operation other than this rule are the same as that of the T-STD. 
[0351] At time T2, the arrival time clock counter is reset to the value of arrivaLtime__stamp of the first source packet 
of TS2. The input timing of the TB1 , Tbn or TBsys to the DVR-STD is determined by the arrivaL_time_stamp of the 
40 source packet of TS2. The Rxn and Rxsys are both changed to values defined in T-STD. 

[0352] By way of explaining additional audio buffering and system data buffering, the audio decoder and the system 
decoder are required to have a surplus buffer quantity (approximately one second equivalent data volume) in addition 
to the buffer volume defined in T-STD. 

[0353] By way of explaining the video presentation timing, display of the video presentation unit must be contiguous, 
45 via junction, in a manner free of gap. It is noted that STC1 is the time axis of the TS1 system time base (shown as 
STC1 in Fig.105), while STC2 is the time axis of the TS2 system time base (shown as STC2 in Fig. 97). More accurately, 
STC2 begins at a time point the first PCR is input to T-STD. 

[0354] The offset between STC1 and STC2 is determined as follows: If PTS1 end is a PTS on STC1 associated with 
the last video presentation unit of TS1 , and PTS2start is PTS on STC2 associated with the first video presentation unit 
50 of TS2, while Tpp is the display period of the last video presentation unit of TS1 , the offset STC_delta between two 
system time bases may be calculated in accordance with the following equation: 

STC_delta = PTSIen + Tpp - PTS2start. 

55 

[0355] By way of explaining the audio presentation, there may be overlap of display timing of audio presentation 
unit, which overlap may be 0 to less than 2 audio frames (see "audio overlap" shown in Fig. 1 05). Which audio sample 
should be selected and re-synchronization of the display of the audio presentation unit to corrected time base at back 
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of the junction point are set on the player. 

[0356] By way of explaining the system time clocks of DVR-STD, the last audio presentation unit of TS1 is displayed 
at time T5. The system time clocks may be overlapped with between timeT2 and time T5. During this domain, DVR-STD 
is switched between the old time base value (STC1 ) and t the new time base value (STC2). The value of STC2 may 
be calculated in accordance with the following equation: 

STC2 = STC1 - STC_delta. 

[0357] The buffering continuity is explained. STC1video_end is the value of STC on the system time base STC1 
when the last byte of the last video packet of TS1 reaches TB1 of DVR-STD, while STC2video_end is a value calculated 
into a value on the system time base STC 2. The STC2video_end is calculated by the following equation: 

STC2video_end = STC1 video_end - STC_delta. 

[0358] In order that the DVR-STD will be obeyed, the following two conditions are required. First, the timing of arrival 
of the first byte of the first video packet of TS2 at the TB1 of the DVR-STD must satisfy the following inequality: 

STC22video_start > STC21 video_end + AT1 . 

[0359] If, in order that the inequality be met, the partial stream of Clipl and/or Clip2 is to be re-encoded and/or re- 
multiplexed, it is carried out as necessary. 

[0360] Next, on the time axis of the system time base in which STC1 and STC2 are converted on the same time 
axis, input of video packets from TS1 and the next following video packet input from TS2 must not overflow nor underflow 
the video buffer. 

[0361] Based on this syntax, data structure and rule, contents of data recorded on the recording medium or the replay 
information can be managed properly such that the user is able to confirm the contents of data properly recorded on 
the recording medium or reproduce desired data readily. 

[0362] Although the present embodiment is explained taking the MPEG-2 transport stream as a multiplexed stream 
as an example, this is merely exemplary such that the present embodiment may be applied to a MPEG2 program 
stream or to a DSS transport stream used in DirecTV service (trademark) of USA. 

[0363] The processing of locating reproduction of a scene represented by a mark point in case the syntax of 
mark_entry() and representative_picture_entry() is configured as shown in Fig. 81 is explained with reference to the 
flowchart of Fig. 106. 

[0364] First, at step S1 a controller 23 of a recording and/or reproducing apparatus 1 reads out the EP_map (Fig. 
70), STCJnfo (Fig.52) and ClipMark (Fig. 78) as database of the DVR transport stream from the recording medium 1 00. 
[0365] At step S2, the controller 23 formulates a list of a thumbnail from the picture referenced by 
representative __picture_entry (Fig. 81 ) or ref_thumbnail_index to output the so formed list from a terminal 24 as a user 
interface input/output for display on a menu of GUI. If. In this case, ref_thumbnaiUndex has an effective value, 
ref_thumbnail_index is prioritized with respect to representative_picture_entry. 

[0366] At step S3, a user designates a mark point of the reproduction start point. This is realized by the user selecting 
a thumbnail picture from the menu screen displayed as GUI. The controller 23 is responsive to this selecting operation 
to acquire a mark point associated with the designated thumbnail. 

[0367] At step S4, the controller 23 acquires PTS of mark_Time_stamp and STC_sequencJd of mark_entry (Fig. 
81) specified at step S3. 

[0368] At step S5, the controller 23 acquires, from STCJnfo (Fig.52), a source packet number at which to start the 
STC time axis associated with STC_sequence_id acquired at step S4. 

[0369] At step S6, the controller 23 acquires, from the packet number at which starts the STC time axis acquired at 
step S5 and from a PTS of the mark point acquired at step S4, a source packet number where there is the entry point 
(l-picture) temporally previous and closest to the PTS of the mark point. 

[0370] At step S7, the controller 23 reads out data of the transport stream from the source packet number containing 
the entry point acquired at step S6 to send the read-out data to the AV recorder 27. 

[0371] At step S8, the controller 23 controls the AV recorder 27 to start the display as from the picture of the PTS of 
the mark point acquired at step S4. 

[0372] The above-described operation is further explained with reference to Figs.107 to 109. 

[0373] It is assumed that a DV transport stream has CM (commercial) inserted in the DVR transport stream file has 
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a STC time axis of STC_EP__start of STC_sequence_id = id = idO, as shown in Fig. 107, and that the source packet 
number with which the time axis begins is smaller than the source packet number of the source beginning point A. It 
is also assumed that the CM (commercial) is added between the source packet numbers B and C. 
[0374] In this case, there are registered in EP_map, corresponding to the EP_map shown in Fig. 70, in association 
with A, B and C shown as RSPN_EP_start, respective PTSs, as PTS(A), PTS(B) and PTS(C), as shown in Fig. 108. 
[0375] Referring to Fig. 109, there are recorded mark_entry and representative_picture_entry, in the ClipMark of Fig. 
78, in association with the mark type values (Fig.79) of 0x92, 0x94 and 0x95 representing scene start, CM start and 
CM end, as shown in Fig. 109. 

[0376] As the Mark_time_stamp of mark_entry, PTS(a1), PTS(bO) and PTS(cO) are registered in association with 
the scene start, CM start and CM end, with the respective STC_sequence_id being idO. 

[0377] Similarly, as mark_time_stamp of the representative_picture_entry, PTS(a2), PTS(bO) and PTS(c) are regis- 
tered in association with the scene start, CM start and CM end, with the respective STC_sequence_id being idO. 
[0378] If PTS(A) < PTS(a1), a packet number A is acquired at step S6. At step S7, a transport stream beginning at 
packet number A is supplied to the AV decoder 27 and, at step S8, display is initiated as from the PTS(a1 ). 
[0379] Referring to the flowchart of Fig. 1 1 0, the processing of CM skip reproduction in case the syntax of mark_entry 
and representative_picture_entry is configured as shown in Fig. 81 . 

[0380] At step S21 , the controller 23 reads out from the recording medium 100 the EP_map (Ftg.70), STCJnfo (Fig. 
52), Programjnfo (Fig.54) and ClipMark (Fig.78). At step S22, the user designates CM skip reproduction from a ter- 
minal 24 as a user interface input/output. 

[0381] At step S23, the controller 23 acquires the PTS of the mark information as the CM start point (0x94) and 
associated STC_sequence_id (Fig. 81). 

[0382] At step S24, the controller 23 acquires, from STCJnfo (Fig.52), a source packet number of the CM beginning 
and end points, where begins the STC time axis corresponding to STC-sequence-id. 

[0383] At step S25, the controller 23 causes the transport stream to be read out from the recording medium to route 
it to the AV decoder 27 to start the decoding. 

[0384] At step S26, the controller 23 verifies whether or not the current displayed picture is a picture of the PTS of 
the CM start point. If the current display picture is not the picture of the CM start time point, the controller proceeds to 
step S27 to continue displaying the picture. The processing then reverts to step S25 to repeat subsequent steps. 
[0385] If, at step S26, the current displayed picture is verified to be a picture of the PTS of the CM start point, the 
processing transfers to step S28 where the controller 23 controls the AV decoder 27 to halt the decoding and display. 
[0386] At step S29, the controller 23 acquires the packet number where begins the STC time axis associated with 
the STC_sequence_id of the CM end point, while acquiring, from the packet number and the PTS of the CM end point 
acquired by the processing at step S23, the source packet number where there is an entry point temporally previous 
and closest to the PTS of the point. 

[0387] At step S30, the controller 23 controls the AV decoder 27 to reinitiate display from the picture of the PTS of 
the CM end point. 

[0388] At step S31 , the controller 23 controls the AV decoder 27 to reinitiate display from the picture of the PTS at 
the CM end point. 

[0389] Referring to Figs. 1 07 to 1 09, the above-described operation is explained further. The CM start time point and 
the CM end point are present in this embodiment on the common time axis of STC_sequence-id = idO. The source 
packet number where begins the STC time axis is selected to be smaller than the source packet number A of the scene 
start point. 

[0390] If the transport stream is decoded and, at step S26, the display time is found to be the PTS (bO), that is if the 
display time point is found to be CM start time point, display is halted by the AV decoder 27. If PTS(C) < PTS(c0) s 
decoding is started from the stream, beginning from the data of the packet number C at step S30. At step S31 , display 
is re-started from the picture of PTS(cO). 

[0391 ] This method can be applied not only to CM skip reproduction but also to skip reproduction of a scene between 
two points specified by the ClipMark in general. 

[0392] Referring to the flowchart of Fig. 11 2, the CM locating reproduction processing-indicated by the mark point, in 
case mark_entry and representative_picture_entry are of the syntax structure shown in Fig. 82, is explained. 
[0393] At step S41 , the controller 23 acquires the information of EP_map (Fig.70), STCJnfo (Fig.52), Programjnfo 
(Fig.54) and ClipMark (Fig.78). 

[0394] At step S42, the controller 23 generates a list of thumbnails from the picture referenced by the 
representative_picture_entry (Fig. 82) or the picture referenced by refjhumbnailjndex contained in the ClipMark (Fig. 
78), read out at step S41 , to display the so generated list on a menu screen. If the refjhumbnailjndex has an effective 
value, ref Jhumbnail_index is prioritized with respect to representative_picture_entry. 

[0395] At step S43, the user specifies a mark point of the replay start point. This designation is carried out e.g., by 
the user selecting the thumbnail picture from the menu screen displayed by the processing of step S42 to specify the 
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mark point associated with the thumbnail. 

[0396] At step S44, the controller 23 acquires the RSPr\l„ref„EP_start and offset_num_pictures (Fig. 82) of the mark 
point specified by the processing of step S43. 

[0397] At step S45, the controller 23 causes data of a transport stream to be read out from the source packet number 
5 corresponding to the RSPN_ref_EP_start acquired at step S44 to route it to AV decoder 27. 

[0398] At step S46, the controller 23 controls the AV decoder 27 to count up the pictures to be displayed, as from 
the picture referenced (without displaying) by the RSPN_ref_EP_start, and starts display as from the picture corre- 
sponding to the count value of offset_num_pictures. 

[0399] Referring to Figs. 1 1 3 to 1 1 5, the above-described processing is explained further. In the present embodiment, 
10 a scene is started in the DVR transport stream file such that the CM is inserted as from the source packet number B 
up to the source packet C. So, PTS(A), PTS(B) and PTS(C) are registered as PTS_EP_start in the EP_map, in asso- 
ciation with A, B and C as RSPN_EP_start, as shown in Fig. 11 4. 

[0400] Moreover, mark_entry and representative_picture_entry are registered in association with scene start, CM 
start and CM end, as shown in Fig.115. In mark_entry, A, B and C are registered as RSPr\Lref_EP_start, in association 
15 with scene start, CM start and CM end, while M1, N1 and N2 are registered as offset_num_pictures. Similarly, in 
representative _picture_entry, M2, N1 and N2 are registered as offset_num_pictures, in association with scene start, 
CM start and CM end, while M1 , N1 and N2 are registered as offset_num_pictures. 

[0401] If locating reproduction is commanded from a picture corresponding to scene start, the decoding is started 
from a stream beginning from data of a packet number A to count up pictures to be displayed, as from the picture PTS 
20 (A), without displaying, to start displaying as from a picture for which offset_num_pictures has assumed the value of M1 . 
[0402] The processing of CM skip reproduction in case the syntax of mark_entry and representative_picture_entry 
are of a structure shown in Fig. 82 is now explained by referring to the flowchart of Fig. 11 6. 

[0403] At step S61 , the controller 23 acquires the information on EP_map (Fig. 70), STCJnfo (Fig. 52), Program__info 
(Fig.54) and ClipMark (Fig. 78). 

25 [0404] If the user commands CM skip reproduction at step S62, the controller 23 acquires, at step S63, 
RSPN_ref_EP_start and offset_num„pictures (Fig. 82) as the mark information of respective points as CM start point 
and CM end point. The CM start point data is RSPN_ref_EP_start(1) and offset_num_pictures(1), whilst the CM end 
point data is RSPN_ref_EP_start(2) and offset_num_pictures(2). 

[0405] At step S64, the controller 23 acquires PTS corresponding to RSPN_ref_EP_start(1) and RSPN_ref_EP_start 
30 (2) from EP_map (Fig.70). 

[0406] At step S65, the controller 23 causes a transport stream to be read out from the recording medium 100 to 
send it to the AV decoder 27. 

[0407] At step S66, the controller 23 verifies whether or not the current display picture is the picture of PTS associated 
with RSPN_ref_EP_start(1) and RSPN_ref_EP_start(2) and, if the current picture displayed is not the picture of PTS 
35 associated with RSPN_ref_EP_start(1 ), it proceeds to step S67 to cause the picture to be displayed continuously. The 
controller 23 then reverts to step S65 to repeat the subsequent process. 

[0408] If, at step S66, the current picture displayed is the picture of PTS associated with RSPN_ref_EP__start(1 ), the 
controller 23 proceeds to step S68 to control the AV decoder 27 to count up the picture displayed as from the picture 
of the PTS associated with RSPN_ref_EP_start(1) to halt the display when the count value is offset_num_pictures(1). 
40 [0409] At step S69, the controller 23 causes transport stream to be read out beginning from the source packet number 
of RSPN_ref_EP_start(2) to route the read-out data to the AV decoder 27. 

[0410] At step S70, the controller 23 counts up the picture to be displayed, as from the picture of PTS associated 
with the RSPN_ref_EP_start(2) (without displaying) and, when the count value is offset_num_pictures(2), the controller 
causes display to be started at the corresponding picture. 
45 [0411] By way of explaining the above-described operation further by referring to Figs.113 to 115, the time PTS(B) 
and PTS(C) corresponding to the packet numbers B and C, respectively, are first obtained, based on EP_map (Fig. 
114). The Clip AV stream is decoded and, when the value is N1 (Fig. 15), the display is halted. 

[0412] Decoding is re-started as from the stream beginning from the data of the packet number C to count up the 
picture displayed, as from the picture of the PTS(C) and, when the value is N2(Fig.115), display is re-started as from 
50 the picture. 

[041 3] The above processing may be applied not only to a case of CM skipping reproduction but also to reproduction 
of a scene with skipping between two points specified by the ClipMark. 

[041 4] The processing of locating a scene specified by a mark point and reproducing the so located scene is explained 
with reference to the flowchart of Fig.1 1 8 in case the syntax of mark_entry and representative_picture entry elemen- 
ts tary stream is as shown in Fig. 84. 

[0415] At step S81, the information on EP_map (Fig.70), STCJnfo (Fig.52), Program Jnfo (Fig.54) and ClipMark 
(Fig.78) is acquired. 

[0416] At step S82, the controller 23 generates a list of thumbnails, referenced by representative_picture_entry or 
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reMhumbnail_index of ClipMark (Fig. 78) to display the so generated list as GUI menu picture. If the 
ref_thumbnail_index has an effective value, reMhumbnailJndex is prioritized with respect to 
rep res e ntati ve_p ict u recent ry . 

[0417] At step S83, a user specifies a mark point of the reproduction start point. This designation is made as the 
5 user selects a thumbnail picture from the menu picture and specifies the mark point associated with the thumbnail. 
[0418] At step S84, the controller 23 acquires the RSPN_mark_point (Fig. 84) specified by the user. 
[041 9] At step S85, the controller 23 acquires, from the EPjnap, the source packet number of the entry point previous 
and closest to the RSPN_mark_point of the mark_point (Fig. 70). 

[0420] At step S86, the controller 23 reads out data of the transport stream from the source packet number associated 
10 with the entry point acquired at step S85 to route the so read out data to the AV decoder 27. 

[0421] At step S87, the controller 23 controls the AV decoder 27 to start display as from the picture referenced by 
RSPN_mark _point. 

[0422] The above-described processing is explained in more detail by referring to Figs.119 to 121. In the present 
embodiment, the DVR transport stream file is such a one in which a scene starts with the source packet and CM is 
15 inserted as from the source packet number B to the source packet number C. So, PTS_EP__start are registered as 
PTS(A), PTS(B) and PTS(C), in association with A, B and C as RSPN_EP_start of Fig. 120. In the ClipMark shown in 
Fig. 21, a1, b1 and c1 are registered as RSPN_mark_point of maker entry, while a2, b1 and c1 are registered as 
RSPN_mark_point of rep rese ntati ve_picture_entry, in association with the scene start, CM start and CM end, respec- 
tively. 

20 [0423] If, in locating a picture at a scene start for reproduction, the packet number A is such that A < a1 , decoding 
is started as from a stream beginning from the data of the packet number A, such that display is started as from a 
picture corresponding to the source packet number a1 . 

[0424] The CM skipping reproduction processing in case the syntax of mark_entry and representative_picture_entry 
is as shown in Fig. 84 is now explained with reference to the flowcharts of Figs. 1 22 and 123. 
25 [0425] At step S101 , the controller 23 acquires the information on EP_map (Fig. 70), STCJnfo (Fig.52), Programjnfo 
(Fig.54) and ClipMark (Fig.78). 

[0426] At step S102, the user specifies CM skipping reproduction. 

[0427] At step S1 03, the controller 23 acquires the RSPN_mark_point (Fig. 84) of the mark information of each point 
for which the mark type (Fig. 79) is the CM start point or the CM end point. The controller 23 sets the data of the CM 
30 start point and data of the CM end point as RSPN_mark_point (1) and as RSPN_mark_point(2), respectively. 

[0428] At step S104, the controller 23 causes a transport stream to be read out from the recording medium 100 to 
output the read out stream for decoding. 

[0429] At step S105, the controller 23 verifies whether or not the current picture is a picture corresponding to 
RSPN_mark_point (1). If the current picture is not a picture corresponding to RSPN_mark_point (1), the controller 23 
35 proceeds to step S106 to continue to display the picture. The processing then reverts to step S11 to repeat the sub- 
sequent process. 

[0430] If, at step S1 05, the current display picture has been verified to be a picture associated with RSPN_mark_point 
(1 ), the processing transfers to step S1 07, where the controller 23 controls the AV decoder 27 to halt the decoding and 
display. 

40 [0431] Then, at step S1 08, the source packet number previous and closest to RSPN_mark_point (2) is acquired from 
EP_map (Fig. 70). 

[0432] At step S109, the controller 23 reads out the transport stream data from the source packet number associated 
with the entry point acquired at step S1 08 to route it to the AV decoder 27. 

[0433] At step S1 1 0, the controller 23 controls the AV decoder 27 to re-start the display as from the picture referenced 

45 by RSPN_mark _point (2). 

[0434] By way of explaining the above-described processing in more detail by referring to the embodiment of Figs. 
1 1 9 to 121, the Clip AV stream is decoded and display is halted at a display picture corresponding to the source packet 
number b1 (Fig.21). If the source packet number C < source packet number c1, decoding is re-started as from the 
stream beginning from the data of the packet number C, and display is re-started as from the stream beginning from 

50 the data of the packet number c1 . 

[0435] As described above, a preset location is specified on the Play List by a time stamp, which time stamp is con- 
verted in each Clip information of each Clip into a data address to have access to a preset position of the Clip AV stream. 
[0436] More specifically, if a book mark or a Resume point is specified by the user as PlayList mark as time stamp 
on the time axis, the PlayList on reproduction may use the ClipMark of the ClipMark being referenced by the PlayList 

55 to access the scene start or end point of the Clip AV stream. 

[0437] Meanwhile, the ClipMark syntax may be as shown in Fig. 126 instead of in Fig.78. 

[0438] In the present embodiment, RSPN_mark is inserted in place of reserved_for_MakerlD, mark_entry() and 
representative_picture_entry of Fig.78. The 32-bit field of this RSPN_mark denotes the relative address of the source 



33 



EP 1 280 348 A1 



packet containing the first byte of the access unit the mark is referencing. RSPN_mark is sized in terms of the source 
packet number as a unit. It is defined in the Clip Information file as from the first source packet of the AV stream file 
and is counted with the value of the offset_SPN as an initial value. 
[0439] The structure is otherwise the same as the structure of Fig. 78. 
5 [0440] The ClipMark syntax may further be configured as shown in Fig. 127. In the present embodiment, 
RSPN_ref_EP_start and offset_num_pictures are inserted in place of RSPN_mark in Fig.126. These are similar to 
those shown in Fig. 82. 

[0441] Fig. 128 is a flowchart for illustrating the preparation of ClipMark of the syntax shown in Fig. 81 in encoding 
and recording analog AV signals. Reference is made to the block diagram of Fig.1 showing a recording and/or repro- 
10 ducing apparatus 1 . At step S200, an analysis unit 14 analyzes input AV signals from terminals 11 , 12 for detecting 
characteristic points, which may be exemplified by a program locating point or a scene change point designating a 
characteristic scene pertinent to contents of an AV stream. 

[0442] At step S201 , the controller 23 acquires PTS of a picture at the characteristic point. At step S202, the controller 
23 stores the information on the characteristic point in ClipMark. Specifically, the information explained in connection 
15 with semantics and syntax of ClipMark of the present embodiment is stored. At step S203, the Clip Information file and 
the Clip AV stream file are recorded on a disc. 

[0443] Fig.1 29 is a flowchart for illustrating the preparation of ClipMark of the syntax of Fig. 81 in recording a transport 
stream input from a digital interface. Reference is had to the block diagram of the recording and/or reproducing appa- 
ratus 1 shown in Fig.1. At step S211 , the demultiplexer 26 and the controller 23 acquire the elementary stream PID of 
20 the program to be recorded. If there are plural elementary streams to be analyzed, the totality of elementary streams 
PIDs are acquired. 

[0444] At step S21 2, the demultiplexer 26 separates the elementary streams from the program of the transport stream 
input from the terminal 13. The AV decoder 27 decodes the elementary stream into AV signals. At step S213, the 
analysis unit 14 analyzes the AV signals to detect characteristic points. 
25 [0445] At step S21 4, the controller 23 acquires the PTS of the picture of the characteristic point and STC-sequence- 
id of the STC to which it belongs. Specifically, the controller 23 stores the information explained in the semantics and 
the syntax of the ClipMark in the present embodiment. 

[0446] At step S21 6, the Clip information file and the Clip AV stream file are recorded on the disc. 
[0447] As shown in the flowcharts of Figs. 128 and 129, the AV stream file, that is the ClipMark for storing the mark 
30 indicating the characteristic picture in the Clip AV stream file, is recorded in the management information data file in 
the AV stream file, that is in the Clip Information file. 

[0448] Fig.1 30 is a flowchart for illustrating the preparation of Real Play List. Reference is made to the block diagram 
of Fig.1 of the recording and/or reproducing apparatus 1 for explanation. At step S221 , the controller 23 records the 
Clip AV stream. At step S222, the controller 23 formulates Playl_ist() comprised of the Play Item covering the totality of 
35 reproducible range of the Clip. If there is a STC discontinuous point in the Clip and Playl_ist() is made up of two or 
more Playltems, connection_condition between the Playltems is also determined. 

[0449] At step S223, the controller 23 creates UIApplnfoPlayList(). At step S224, the controller 23 forms PlayListMark. 
At step S225, the controller 23 forms MakersPrivateData. At step S226, the controller 23 records the Real PlayList file. 
[0450] So, one Real PlayList file is created every time a Clip AV stream is newly recorded. 
40 [0451] Fig. 131 is a flowchart illustrating the formation of the Virtual PlayList. At step S231 , reproduction of one Real 
PlayList, recorded on the disc, is specified through a user interface. From the reproduction range of the Real PlayList, 
the reproduction domain indicated by IN and OUT points is specified. 

[0452] At step S232, the controller 23 verifies whether or not the designating processing for the reproduction range 
by the user has been finished in its entirety. If, atstepS232, it is verified that the operation of designating the reproduction 
45 range by the user has not been finished, processing reverts to step S231 to repeat the subsequent processing. If it is 
verified that the operation has been finished, processing transfers to step S233. 

[0453] At step S233, the connection condition (connection_condition) between two consecutively reproduced repro- 
duction domains is determined by the user over a user interface, or by the controller 23. If the user fails to produce a 
sub-path, the processing at step S234 is skipped. 
so [0454] At step S235, the controller 23 forms PlayListQ based on the reproduction range information specified by the 
user and on the connection_condition. At step S236, the controller 23 forms UlApplnfoPlayListQ. At step S237, the 
controller 23 forms PlayListMark. At step S238, the controller 23 forms MakerPrivateData. At step S239, the controller 
23 causes the Virtual PlayList file to be recorded on the disc. 

[0455] From the playback range of the RealPlayList, recorded on the disc, the user selects the playback range the 
55 or she desires to view, and one Virtual PlayList file is formed from one set of the playback ranges grouped together to 
another. 

[0456] Fig. 132 is a flowchart for illustrating the reproduction of the PlayList. Reference is had to the block diagram 
of the recording and/or reproducing apparatus 1 of Fig.1 . At step S241 , the controller 23 acquires Info.dvr, Clip Infor- 
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mation file, PlayList file and the information on the thumbnail file to form a GUI picture showing a list of the PlayList 
recorded on the disc for demonstration on the GUI. 

[0457] At step S242, the user commands the controller 23 to reproduce one PlayList through a user interface. At 
step S243, the controller 23 acquires a source packet number having an entry point temporally previous and closest 

5 to IN_timefrom the PTS of IN_time and STC-sequence-id of the current Playltem. At step S244, the controller 23 reads 
out the data of the AV stream from the source packet number to route the so read-out data to the AV decoder 27. 
[0458] If the Playltem is reproduced temporally before the above-mentioned Playltem, the controller 23 manages 
control at step S245 so that connection processing of display to the Playltem will be performed in accordance with 
connection_condition. At step S246, the AV decoder 27 starts display as from the picture of the PTS of IN_time. 

w [0459] At step S247, the AV decoder 27 continuously decodes the AV stream. At step S248, the controller 23 checks 
whether or not the picture currently displayed is a picture of PTS of OUT_time. If it is verified at step S28 that the picture 
currently displayed is a picture of the PTS of OUT_time, processing transfers to step S250 and, if otherwise, to step 
S249. 

[0460] At step S249, the processing of displaying the picture verified to be a PTS picture is executed. Then, process- 
's ing reverts to step S247 to repeat the subsequent processing. On the other hand, the controller 23 at step S250 verifies 
whether or not the Playltem is the last one in the current PlayList and, if the Playltem is verified to be the last one in 
the current PlayList, processing of the flowchart of Fig. 132 is terminated. If the Playltem is verified not to be the last 
one in the current PlayList, processing reverts to step S243 to repeat the subsequent processing. 
[0461 ] Fig. 1 33 is a flowchart for illustrating the preparation of PlayListMark. Reference is made to the block diagram 
20 of Fig.1 showing the respective apparatus 1. 

[0462] At step S262, reproduction of one PlayList is commanded by the user to the controller 23. At step S263, the 
reproducing unit 3 starts reproducing the designated PlayList, as explained with reference to the flowchart of Fig. 132. 
[0463] At step S264, the user commands the controller 23, through the user interface, the setting of a mark at a 
favorite scene. At step S265, the controller 23 acquires the PTS of the mark and the PlayList_id of the Playltem to 
25 which it belongs. 

[0464] At step S266, the controller 23 stores the mark information in the PlayListMark(). At step S267, the PlayList 
file is recorded on the disc. 

[0465] In this manner, the mark point specified by the user from the reproduction range of the PlayList, or the Play- 
ListMark, for storing the marks specifying the resume point in reproducing the ClipMark of the Clip, is recorded on the 
30 PlayList file. 

[0466] Fig. 134 is a flowchart for illustrating the locating reproduction employing the ClipMark of the Clip referenced 
by the PlayList and the PlayListMark in reproducing the PlayList. The syntax of C!ipMark() is as shown in Fig. 81. 
Reference is made to the block diagram of Fig.1 showing the recording and/or reproducing apparatus 1 for explanation. 
[0467] At step S271 , the controller 23 acquires the information of Info.dvr, Clip Information file, PlayList file and the 
35 information on the thumbnail file to create a GU I picture showing a list of PI ay Lists recorded on the disc for demonstration 
on the GUI. 61 through the user interface. 

[0468] At step S272, reproduction of one PlayList is commanded by the user through a user interface. At step S273, 
the controller 23 displays a list of thumbnails generated from a picture referenced by the ClipMark of the Clip referenced 
by the PlayList through the user interface on the GUI. 
40 [0469] At step S274, a mark point of the reproduction start point is specified to the controller 23 by the user. At step 
S275, the controller 23 verifies whether or not the mark selected by the processing at step S274 is the mark stored in 
the PlayListMark. If the selected mark is verified at step S275 to be a mark stored in the PlayListMark, processing 
transfers to step S276 and, if otherwise, to step S278. 

[0470] At step S276, the controller 23 acquires the PTS of the mark and the Playltem_id to which it belongs. At step 
45 S277, the controller 233 acquires the STC-sequence-id of the AV stream referenced by the Playltem indicated by the 
Playltemjd. 

[0471] At step S278, the controller 23 causes the AV stream to be input to the AV decoder 27, based on STC- 
sequence-id and the PTS of the mark. Specifically, the processing similar to that of steps S243 and S244of the flowchart 
of Fig.1 32 is performed, using the STC-sequence-id and the PTS of the mark. At step S279, the reproducing unit 3 

50 starts display as from the picture of the PTS of the mark point. 

[0472] When the PlayList is reproduced, the mark stored in the ClipMark of the Clip referenced by the PlayList can 
be referenced, as explained in connection with Fig. 9. Thus, if one clip is referenced by RealPlayList or plural Virtual 
PlayLists, these PlayLists can co-own the ClipMark of the Clip, so that mark data can be managed efficiently. 
[0473] If PlayListMark and ClipMark, combined together, are defined in the PlayList, without defining the ClipMark 

55 in the Clip, and if one Clip is referenced by Real PlayList or plural PlayLists, the respective PlayLists have the ClipMark 
information of the same contents, thus worsening the data recording efficiency. 

[0474] Fig. 135 shows another typical syntax of PlayListMark(). Length indicates the number of bytes as from the 
byte directly after the length field to the last byte of the PlayListMarkQ. Number_of_PlayList_marks denote the number 
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of entries of marks stored in the PlayListMark. 

[0475] Mark_invalid_flag is a one-bit flag and, if this value is set to 0, it indicates that this mark has the effective 
information. If the value is set to 1 , it indicates that this mark is effective. 

[0476] If the user has made an operation of erasing the entry of a mark on the user interface, the recording and/or 
reproducing apparatus 1 may change the value of the mark_invalid_flag to 1 instead of erasing the entry of the mark 
from the PlayMarkList. 

[0477] Mark_type denotes the mark type and has the meaning shown in Fig. 136. Markjriamejength denotes the 
byte length of the mark name indicated in the mark_name field. The value of this field is not larger than 32. 
Ref_to_Playltem_id denotes the value of Playltem_id specified by the Playltem where the mark is put. The value 
defined in Playl_ist() corresponding to a given Playltem is defined in the PlayList(), 

[0478] Mark_time_stamp stores the time stamp indicating the point specified by the mark. Mark_time_stamp specifies 
the time in the reproduction range specified in IN_time and OUT_time defined in the Playltem specified by the 
ref_to_Playltem_id. The meaning of time stamp is the same as that of Fig. 44. 

[0479] If entry_ES_PID is set to OxFFFF, the mark is a pointer on the common time axis common to the totality of 

elementary streams used by the PlayList. If entry„ES_PlD is set to a value other than OxFFFF, entry_ES_PID denotes 

the value of the PID of the transport packet containing the elementary stream specified by the mark. 

[0480] ReMhumbnaiUndex denotes the information of the thumbnail picture appended to the mark. The meaning 

of the information is the same as the ref_thumbnail_index. Mark_name denotes the mark name. The number of bytes 

specified by the mark_name_length from left of the field is the effective character letters, and denotes the name. The 

character letters are encoded by a method specified by the character_set in the U I App Info PlayList. 

[0481] The value of bytes next following the effective character letters in the mark_name field may be any value. In 

the case of this syntax, marks may denote the specified elementary stream. For example, if the PlayList references a 

multi-view program having plural video streams in the program, entry_ES_PID is used for setting the video PID denoting 

one video stream in the program. 

[0482] It is assumed that the user is reproducing the PlayList referencing the multi-view program, and that the user 
is seeing one view of the multi-view. It is also assumed that the user has sent a command for skipping the reproduction 
to the next mark point. In such case, the recording and/or reproducing apparatus 1 should use the mark of the 
entry_ES_PID of the same value as the video PID of the view currently seen by the user. The recording and/or repro- 
ducing apparatus should not change the view arbitrarily. The recording and/or reproducing apparatus 1 may use the 
mark set on the entry_ES_PID of OxFFFF In this case, the view is not to be changed arbitrarily by the recording and/ 
or reproducing apparatus 1 . 

[0483] Fig. 137 shows a modification of the ClipMark() of the syntax shown in Fig. 81 . Length denotes the number of 
bytes as from the byte directly following the length field to the last byte of the ClipMark(). If mark_type specifies a value 
from 0x60 to 0x7F, makeMD denotes the ID of the maker who has defined the mark_type. 

[0484] Number_of_Clip_marks denotes the number of entries of the marks stored in ClipMark. Mark_invalid_flag is 
a one-bit flag and, if its value is set to 0 or 1, it indicates that the mark has the effective information or is invalid, 
respectively. 

[0485] If the user has made an operation of erasing an entry of a mark on the user interface, the recorder may change 
the value of the mark_invalid_flag to 1 instead of erasing the mark entry from the ClipMark. Markjype indicates the 
mark type and has the meaning shown in Fig. 138. 

[0486] Ref_to_STC_id denotes the STC-sequence-id specifying the STC-sequence where both the 
mark_time_stamp and representative_picture_time_stamp are located. The value of STC-sequence-id is defined in 
STCInfo(). Mark_time_stamp has the same meaning as the mark_time_stamp in case of mark_entry() in Fig. 81. 
[0487] If entry_ES_PID is set to OxFFFF, its mark is a pointer to the time axis common to the totality of the elementary 
streams in the Clip. If entry_ES_PID is set to a value other than OxFFFF, entry_ES_PID denotes the value of PID of 
the transport packet containing the elementary stream specified by the mark. 

[0488] Ref_to_thumbnail_index denotes the information of a thumbnail picture appended to the mark. Its meaning 
is the same as that of reMhumbnailJndex of Fig. 78. Representative_picture_time_stamp has the same meaning as 
that of mark_time_stamp in the case of representative_picture_entry() of Fig. 81 . 

[0489] In the case of the syntax shown in Fig. 1 37, the mark may indicate a particular elementary stream. For example, 
if Clip contains a multi-view program having plural video streams in a program, entry_ES_PID is used for setting the 
video PID specifying one video stream in the program. 

[0490] It is assumed that the user is reproducing a PlayList referencing a multi-view program and that the user is 
seeing one view of the multi-view. It is also assumed that the user has sent to the recording and/or reproducing appa- 
ratus 1 a command of skipping-reproducing the next mark point. In such case, the recording and/or reproducing ap- 
paratus 1 should use the mark of entry_ES_PID, having the same value as that of the video PID of the view currently 
seen by the user, while the recording and/or reproducing apparatus 1 is not allowed to change the view arbitrarily. The 
recording and/or reproducing apparatus 1 may use the mark for which entry_ES_PID is set to OxFFFF. In such case, 
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the recording and/or reproducing apparatus again is not allowed to change the view at will. 

[0491] Based on the above-described syntax, data structure and rule, the contents of data recorded on the recording 
medium 100 or the replay information can be managed appropriately to reproduce the desired data extremely readily. 
[0492] With the above-described data structure, the PlayList file and the Clip Information file, for example, are sep- 
arated from each other and recorded in this form. Thus, if the contents of preset PlayList or Clip are changed on editing, 
there is no necessity of changing other files unrelated with the file. As a result, the file contents can be changed readily 
while shorter time may suffice for such changing or recording. 

[0493] If initially Info.dvr is read, the disc recording contents are presented to the user interface and only the PlayList 
file commanded for reproduction by the user and the Clip Information file pertinent thereto are read out from the disc, 
the user queuing time may be reduced. 

[0494] If the totality of the PlayList file or the Clip Information file are formed into one file and recorded into one file, 
the file size is increased considerably, and hence the time needed in changing the file contents for recording is signif- 
icantly larger than in the case of separating the respective files for recording. Such deficiency may be prevented from 
occurring by application of the present invention. 

[0495] As described above, the AV stream file, that is the ClipMark for storing the mark specifying a characteristic 
picture in the AV stream file, is recorded in the management information data file of the AV stream, while the mark point 
specified by the user from the reproduction range of the PlayList, that is the object having the information on the 
reproduction sequence defined by the combination of the domains specified in the AV stream, or the PlayListsMark for 
storing the mark indicating the Resume point in reproducing the object, is recorded on the object. 
[0496] By so doing, the mark stored in the ClipMark of the Clip referenced by the PlayList can be referenced in 
reproducing the PlayList. Therefore, these PlayLists are able to co-own one ClipMark when one Clip is referenced by 
the ReaiPlayList or plural Virtual PlayLists, so that mark data can be managed efficiency. 

[0497] If, in case ClipMark is not defined in Clip, but PlayMarkList and ClipMark, combined together, are defined only 
in the PlayList, one Clip is referenced by Real PlayList or plural Virtual PlayLists, the respective PlayLists own the mark 
information of the same contents. The data recording efficiency then is low. This can be prevented from occurring by 
application of the present invention. 

[0498] if EP_mark for storing the address of the entry point as the AV stream ancillary information and the ClipMark 
for storing the picture type of the mark point, such as program locating point, and the address in the AV stream of the 
picture, are rendered into a file and recorded in this form on the recording medium 100, the encoding information of a 
stream necessary for reproducing the AV file can be managed appropriately. 

[0499] By this Clip Information file information, the user is able to search a scene of interest from the AV stream 
recorded in the recording medium 100, such as program locating point, thus facilitating the decision of the readout 
position of the AV stream from the recording medium 1 00 responsive to the command for random access by the user 
or commands for special reproduction, while enabling stream decoding to be started promptly. 

[0500] The above-described sequence of operations can be executed not only on a hardware but also on a software. 
In this case, the recording and/or reproducing apparatus 1 can be formed by a personal computer such as is shown 
in Fig. 139. 

[0501] Referring to Fig. 139, the CPU (central processing unit) 201 executes a variety of processing operations in 
accordance with a program stored in a ROM (read-only memory) 202 or a program loaded from the memory unit 208 
to the RAM (random-access memory) 203, in which RAM data necessary in executing a variety of operations to be 
performed by the CPU 201 are stored. 

[0502] The CPU 201 , ROM 202 and RAM 203 are interconnected over a bus 204, to which bus there is also connected 
an input/output interface 205. 

[0503] To the input/output interface 205, there are connected an input unit 206 comprised of a keyboard and a mouse, 
a display made up of CRT and LCD, an output unit 207, such as a loudspeaker, a memory device 208, such as a hard 
disc, and a communication unit 209, made up e.g., of a modem or a terminal adapter. The communication unit 209 
performs communication processing over a network. 

[0504] To the input/output interface 205, there are connected a drive 21 0, a magnetic disc 221 , an optical disc 222, 
a magneto-optical disc 223 or a semiconductor memory 224, as necessary, while a computer program read out there- 
from are installed as necessary in the memory device 208. 

[0505] The above-described sequence of operations may be executed not only on a hardware but also on a software. 
If the sequence of operations is to be carried out on the software, the program forming the software or a variety of 
programs are installed on a dedicated hardware of a computer, such that the programs are installed form a recording 
medium, such as a general-purpose personal computer. 

[0506] Referring to Fig. 1 39, this recording medium may be constituted by a package medium distributed for furnishing 
the user with a program, in addition to a computer. The package medium may be exemplified by, for example, a magnetic 
disc 221 (inclusive of floppy disc), an optical disc 222 (inclusive of CD-ROM (Compact Disc-Read Only memory) and 
DVD (Digital versatile Disc)), a magneto-optical disc 223 (inclusive of MD (Mini-Disc)) or a semiconductor memory 
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224. In addition, the recording medium is constituted by a hard disc furnished to the user as it is pre-loaded on a 
computer and which includes a ROM 202 or a memory 208 having the program stored therein. 

[0507] In the present specification, the respective steps stating the sequence of the program furnished by a medium 
includes not only the processing executed chronologically in accordance with the stated order but also the processing 
executed in parallel or batch-wise. 

[0508] The system in the present specification denotes an entire apparatus made up of plural devices. 
Industrial Applicability 

[0509] In the information processing method and apparatus and the program, according to the present invention, 
ClipMark formed by a mark specifying a characteristic picture extracted from an input AV stream, as management 
information for supervising the AV stream, and PlayListMark, formed by marks pointing to a picture optionally specified 
by a user, from a reproduction domain corresponding to the PlayList, which defines the combination of preset domains 
in the AV stream, are generated, while the ClipMark and PlayListMark are recorded as respective independent tables 
on a recording medium. Thus, it becomes possible to access a desired location of an AV stream promptly and reliably. 
[0510] Moreover, in the information processing method and apparatus and the program, according to the present 
invention, the management information for supervising the AV stream including ClipMark formed by marks pointing to 
a characteristic picture extracted from an AV stream, is read out, and PlayListMark formed by a mark pointing to a 
picture optionally specified by a user is read out from a reproduction domain corresponding to the PlayList which defines 
the combination of preset domains in the AV stream. Moreover, the management information read out by the readout 
step and the information by the PlayListMark are presented, the ClipMark corresponding to the PlayList commanded 
to be reproduced by the user is referenced from the information presented by the presentation means, and the AV 
stream including the ClipMark, referenced by the referencing step, is reproduced from a position corresponding to the 
ClipMark. Thus, it becomes again possible to access a desired location of an AV stream promptly and reliably. 



Claims 

1. An information processing apparatus comprising: 

generating means for generating ClipMark formed by a mark specifying a characteristic picture extracted from 
an input AV stream, as management information for supervising the AV stream, and for generating PlayList- 
Mark, formed by marks pointing to a picture optionally specified by a user, from a reproduction domain corre- 
sponding to the PlayList defining the combination of preset domains in said AV stream; and 
recording means for recording said ClipMark and PlayListMark as respective independent tables on a recording 
medium. 

2. The information processing apparatus according to claim 1 wherein said generating means generates said Clip- 
Mark and said PlayListMark as a Cliplnformation file and as a PlayList file, respectively. 

3. The information processing apparatus according to claim 1 characterized in that said PlayListMark further in- 
cludes a mark specifying a Resume point in reproducing said PlayList. 

4. The information processing apparatus according to claim 1 wherein, in reproducing said PlayList, said mark forming 
the ClipMark of said AV stream corresponding to the reproduction domain of said PlayList is referenced. 

5. The information processing apparatus according to claim 1 wherein said mark of the PlayListMark includes pres- 
entation time stamp and the identification information for indicating one reproduction domain specified on said AV 
stream data forming the reproduction route of said PlayList. 

6. The information processing apparatus according to claim 1 wherein said mark forming the ClipMark or said mark 
forming said PlayListMark includes the information specifying an entry point of an elementary stream. 

7. The information processing apparatus according to claim 1 wherein said mark of the PlayListMark includes the 
information of the type at least including the Resume point of the start point of a favorite scene or a Resume point 
of the PlayList specified by a user. 

8. The information processing apparatus according to claim 1 wherein said mark of the ClipMark and said mark of 



38 



EP 1 280 348 A1 



the PlayListMark are represented by an address of a relative source packet corresponding to an entry point of said 
AV stream. 

9. The information processing apparatus according to claim 8 wherein said mark of the ClipMark and said mark of 
5 the PlayListMark are represented by a first address of a relative source packet corresponding to an entry point of 

said AV stream and by a second address of an offset from said first address, respectively. 

10. The information processing apparatus according to claim 1 further comprising: 

10 type detection means for detecting the type of said characteristic picture detected at the time of recording by 

said first recording means; 

said first recording means recording said mark forming said ClipMark and said type as detected by said de- 
tection means. 

15 11. The information processing apparatus according to claim 1 wherein said mark of said ClipMark includes a scene 
change point, a commercial start point, commercial end point or a title displaying scene. 

12. An information processing method comprising: 

20 a step of generating ClipMark formed by a mark specifying a characteristic picture extracted from an input AV 

stream, as management information for supervising the AV stream, and of generating PlayListMark, formed 
by marks pointing to a picture optionally specified by a user, from a reproduction domain corresponding to the 
PlayList defining the combination of preset domains in said AV stream; and 

a step of recording said ClipMark and PlayListMark as respective independent tables on a recording medium. 

25 

13. A recording medium having recorded thereon a computer-readable program, said computer-readable program 
including 

a step of generating ClipMark formed by a mark specifying a characteristic picture extracted from an input 
AV stream, as management information for supervising the AV stream, and of generating PlayListMark, formed by 
30 marks pointing to a picture optionally specified by a user, from a reproduction domain corresponding to the PlayList 

defining the combination of preset domains in said AV stream; and 

a step of recording said ClipMark and PlayListMark as respective independent tables on a recording medium. 

14. A program for having a computer execute 

35 a step of generating ClipMark formed by a mark specifying a characteristic picture extracted from an input 

AV stream, as management information for supervising the AV stream, and of generating PlayListMark, formed by 
marks pointing to a picture optionally specified by a user, from a reproduction domain corresponding to the PlayList 
defining the combination of preset domains in said AV stream; and 

a step of recording said ClipMark and PlayListMark as respective independent tables on a recording medium. 

40 

15. An information processing apparatus comprising: 

readout means for reading the management information for supervising said AV stream including ClipMark 
formed by marks pointing to a characteristic picture extracted from an AV stream, and also for reading, from 
45 a reproduction domain corresponding to the PlayList defining the combination of preset domains in said AV 

stream, PlayListMark formed by a mark pointing to a picture optionally specified by a user; 
presentation means for presenting said management information read out by said readout means and the 
information by said PlayListMark; 

referencing means for referencing, from said information presented by said presentation means, said ClipMark 
so corresponding to the PlayList commanded to be reproduced by the user; and 

reproducing means for reproducing said AV stream including said ClipMark, referenced by said referencing 
means, from a position corresponding to said ClipMark. 

16. The information processing apparatus according to claim 15 wherein said presentation means presents a list by 
55 said thumbnail picture associated with said PlayListMark. 

17. The information processing apparatus according to claim 15 wherein said mark corresponding to said ClipMark 
and said mark corresponding to said PlayListMark are represented by an address of a relative source packet 
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corresponding to an entry point of said AV stream. 

18. The information processing apparatus according to claim 17 wherein said mark of the ClipMark and said mark of 
the PlayListMark are represented by a first address of a relative source packet corresponding to an entry point of 
said AV stream and by a second address of an offset from said first address, respectively. 

19. The information processing apparatus according to claim 15 wherein said mark of said ClipMark includes a scene 
change point, a commercial start point, a commercial end point or a title displaying scene. 

20. An information processing method comprising: 

a readout step of reading the management information for supervising said AV stream including ClipMark 
formed by marks pointing to a characteristic picture extracted from an AV stream, and also for reading, from 
a reproduction domain corresponding to the PlayList defining the combination of preset domains in said AV 
stream, PlayListMark formed by a mark pointing to a picture optionally specified by a user; 
a presentation step of presenting said management information read out by said readout step and the infor- 
mation by said PlayListMark; 

a referencing step of referencing, from said information presented by said presentation step, said ClipMark 
corresponding to the PlayList commanded to be reproduced by the user; and 

a reproducing step of reproducing said AV stream including said ClipMark, referenced by said referencing 
step, from a position corresponding to said ClipMark. 

21. A recording medium having recorded thereon a computer-readable program, said computer-readable program 
including 

a readout step of reading the management information for supervising said AV stream including ClipMark 
formed by marks pointing to a characteristic picture extracted from an AV stream, and also for reading, from a 
reproduction domain corresponding to the PlayList defining the combination of preset domains in said AV stream, 
PlayListMark formed by a mark pointing to a picture optionally specified by a user; 

a presentation step of presenting said management information read out by said readout step and the infor- 
mation by said PlayListMark; 

a referencing step of referencing, from said information presented by said presentation step, said ClipMark 
corresponding to the PlayList commanded to be reproduced by the user; and 

a step of recording said ClipMark and PlayListMark as respective independent tables on a recording medium. 

22. A program for having a computer execute 

a readout step of reading the management information for supervising said AV stream including ClipMark 
formed by marks pointing to a characteristic picture extracted from an AV stream, and also for reading, from a 
reproduction domain corresponding to the PlayList defining the combination of preset domains in said AV stream, 
PlayListMark formed by a mark pointing to a picture optionally specified by a user; 

a presentation step of presenting said management information read out by said readout step and the infor- 
mation by said PlayListMark; 

a referencing step of referencing, from said information presented by said presentation step, said ClipMark 
corresponding to the PlayList commanded to be reproduced by the user; and 

a step of recording said ClipMark and PlayListMark as respective independent tables on a recording medium. 

23. A program having recorded thereon, as independent tables, the management information for supervising said AV 
stream including ClipMark formed by marks pointing to a characteristic picture extracted from an AV stream, and 
also for reading, from a reproduction domain corresponding to the PlayList defining the combination of preset 
domains in said AV stream, PlayListMark formed by a mark pointing to a picture optionally specified by a user. 
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